Jump to content

Welcome to the new Traders Laboratory! Please bear with us as we finish the migration over the next few days. If you find any issues, want to leave feedback, get in touch with us, or offer suggestions please post to the Support forum here.

  • Welcome Guests

    Welcome. You are currently viewing the forum as a guest which does not give you access to all the great features at Traders Laboratory such as interacting with members, access to all forums, downloading attachments, and eligibility to win free giveaways. Registration is fast, simple and absolutely free. Create a FREE Traders Laboratory account here.

darthtrader

Data Mining with R and Rapidminer

Recommended Posts

From the thread that brownsfan and I posted in about Steenbarger and his volume/days range stuff until now I've stumbled across some AMAZING stuff in this area. To program alot of this stuff in stock software is to reinvent the wheel.

 

This interesting thread came up on Elite that mentions Rapidminer.

http://elitetrader.com/vb/showthread.php?s=&threadid=117361

 

In trying to find information about it I finally installed and actually loaded some YM data into the heavy weight open source statistical computing environment, R. Its amazing how simple it makes tasks that are far beyond most stock software analysis wise.

 

Here is an entire course on data mining with R

http://www.stats202.com/

The videos for the class are all up on google video if you search for data mining + long format.

 

This is an entire free book on data mining with R, 2 different hands on projects and one is a forecast for IBM stock prices using nets.

http://www.liaad.up.pt/~ltorgo/DataMiningWithR/

 

this has a 5 part video tutorial on using Rapidminer:

http://www.neuralmarkettrends.com/tutorials/

 

Jerry in that elite thread had a project idea for rapidminer and I had just sent him a message inviting him to bring it to TL. The premium section here would be quite an ideal place for such results.

 

is anyone else interested in this stuff? While I'm not interested in "prediction" it just seems like a huge waste of time to not use these tools to find relationships that would take years of experience to uncover(if ever).

Share this post


Link to post
Share on other sites

Well I have already given up on Rapidminer. While extremely interesting there simply is no way documentation wise that someone with no background in stats/data mining is going to be able to get up to speed with it.

R is a different story having entire series of textbooks written on its use from introductory stats to extremely complex stuff.

 

I've just ordered this textbook which sounded like a good way to get up to speed on the various mining algorithms

Data Mining: Concepts and Techniques, 2nd ed

http://www.amazon.com/Data-Mining-Second-Techniques-Management/dp/1558609016/ref=pd_sim_b_3

 

I'm still trying to figure out what R book to go with but this one sounds pretty nice, focusing on working code and available data sets to learn R with.

http://www.amazon.com/Statistics-Introduction-Michael-J-Crawley/dp/0470022981/ref=pd_bbs_sr_1?ie=UTF8&s=books&qid=1209041520&sr=1-1

Share this post


Link to post
Share on other sites

Just some general R resources before I lose them.

 

RExcel is probly the way to go at this level. You can basically either use R within excel or read and write excel files directly in R. Should be able to get R doing real time market stuff with a data providers DDE link with this.

http://sunsite.univie.ac.at/rcom/server/doc/RExcel.html

 

Good R learning resource

http://www.mayin.org/ajayshah/KB/R/index.html

http://www.agr.kuleuven.ac.be/vakken/statisticsbyR/

 

good financial stuff in R

http://www.burns-stat.com/

 

charts for R

http://addictedtor.free.fr/graphiques/RGraphGallery.php?graph=65

 

financial engineering stuff

http://www.rmetrics.org/

Share this post


Link to post
Share on other sites

ok - new to data mining (although not new to the concept, or to the markets). Is there anything online you can reccomend for me if I'm lookign to really start from scratch. Can program in VBA so I'm sure I could get my head round the code if I try.

 

Any / all answers appreciated. Have a tactical relative value trading product here which I am spearheading (in between doing a grillion other things as per usual), so might be getting involved in this a bit. Been looking at all sorts of options. FX engines etc etc. But while there are plenty of pruducts out there to backtest stuff for you if you are specific enough, ultimately what I really think I have to do is do the data mining myself.

 

And for both cost and time reasons I'm thinking it might be easiest to actually do it myself (or at least, supervise while ony of our jnr traders does the grunt work) than pay for an engineer to come in and do it (especially as I'm not 100% sure what I'm looking for, just have a few ideas I want to look at right now and thats never an efficient use of outside IT type people in my experience).

 

Sorry this post is a bit rambling - busy afternoon on the desk so I keep coming back and adding a bit

 

GJ

Share this post


Link to post
Share on other sites

I'm actually looking to do some data mining myself soon. I have JMP 7 by SAS. It's an extrodaniry piece of software to mine large amounts of data:

 

http://www.jmp.com/software/jmp7/

 

I use Investor RT and the only problem I'm having at the moment is figuring out how to extract 5 years worth of Market Profile pivot data for the E-Mini S&P. If anyone can help do this I can hopefully post some results for people in the premium forum.

 

Cheers.

 

:)

Share this post


Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.


  • Topics

  • Posts

    • AMZN Amazon stock, nice buying at the 187.26 triple+ support area at https://stockconsultant.com/?AMZN
    • DELL Dell Technologies stock, good day moving higher off the 90.99 double support area, from Stocks to Watch at https://stockconsultant.com/?DELL
    • MCK Mckesson stock, nice trend and continuation breakout at https://stockconsultant.com/?MCK
    • lmfx just officially launched their own LMGX token, Im planning to grab a couple of hundred and maybe have the option to stake them. 
    • Date: 2nd April 2025.   Market on Edge: Tariff Announcement and Volatility Ahead!   The US economic and employment data continues to deteriorate with the job vacancies figures dropping to a 5-month low. In addition to this, the IMS Manufacturing PMI also fell below expectations. However, both the US Dollar and Gold declined simultaneously following the release of the two figures, an uncommon occurrence in the market. Traders expect a key factor to be today’s ‘liberation day’ where the US will impose tariffs on imports. USDJPY - Traders Await Tariff Confirmation! Traders looking to determine how the USDJPY will look today will find it difficult to determine until the US confirms its tariff plan. Today is the day when Trump previously stated he would finalize and announce his tariff plan. The administration has not yet released the policy, but investors expect it to be the most expansionary in a century. President Trump is due to speak at 20:00 GMT. On HFM's Calendar the speech is stated as "US Liberation Day Tariff Announcement". Currently, analysts are expecting Trump’s Tariff Plan to impose tariffs on the EU, chips and pharmaceuticals later today as well as reciprocal tariffs. Economists have a good idea of how these tariffs may take effect, but reciprocal tariffs are still unspecified. In addition to this, 25% tariffs on the car industry will start tomorrow. The tariffs on the foreign cars industry are a factor which will particularly impact Japan. Although, traders should note that this is what is expected and is not yet finalised. Last week, President Trump stated that he would implement retaliatory tariffs but allow exemptions for certain US trade partners. Treasury Secretary Mr Bessent and National Economic Council Director Mr Hassett suggested that the restrictions would primarily target 15 countries responsible for the bulk of the US trade deficit. However, yesterday, Trump contradicted these statements, asserting that additional duties would be imposed on any country that has implemented similar measures against US products. The day’s volatility will depend on which route the US administration takes. The harshness of the policy will influence both the Japanese Yen as well as the US Dollar.   USDJPY 5-Minute Chart   US Economic and Employment Data The JOLT Job Vacancies figure fell below expectations and is lower than the previous month’s figure. The JOLT Job Vacancies read 7.57 million whereas the average of the past 6 months is 7.78 million. The ISM Manufacturing Index also fell below the key level of 50.00 and was 5 points lower than what analysts were expecting. The data is negative for the US Dollar, particularly as the latest release applies more pressure on the Federal Reserve to cut interest rates. However, this is unlikely to happen if the trade policy ignites higher and stickier inflation. In the Bank of Japan’s Governor's latest speech, Mr Ueda said that the tariffs are likely to trigger higher inflation. USDJPY Technical Analysis Currently, the Japanese Yen Index is the worst performing of the day while the US Dollar Index is more or less unchanged. However, this is something traders will continue to monitor as the EU session starts. In the 2-hour timeframe, the USDJPY is trading at the neutral level below the 75-bar EMA and 100-bar SMA. The RSI and MACD is also at the neutral level meaning traders should be open to price movements in either direction. On the smaller timeframes, such as the 5-minute timeframe, there is a slight bias towards a bullish outcome. However, this is only likely if the latest bearish swing does not drop below the 200-Bar SMA.     The key resistant level can be seen at 150.262 and the support level at 149.115. Breakout levels are at 149.988 and 149.674. Key Takeaway Points: Job vacancies hit a five-month low, and the ISM Manufacturing PMI missed expectations, adding pressure on the Federal Reserve regarding interest rate decisions. Traders await confirmation on Trump’s tariff policy, which is expected to impact the EU, chips, pharmaceuticals, and foreign car industries. The severity of the tariffs will influence both the JPY and the USD, with traders waiting for final policy details. The Japanese Yen Index is the worst index of the day while the US Dollar Index is unchanged. Always trade with strict risk management. Your capital is the single most important aspect of your trading business.   Please note that times displayed based on local time zone and are from time of writing this report.   Click HERE to access the full HFM Economic calendar.   Want to learn to trade and analyse the markets? Join our webinars and get analysis and trading ideas combined with better understanding of how markets work. Click HERE to register for FREE!   Click HERE to READ more Market news.   Michalis Efthymiou HFMarkets   Disclaimer: This material is provided as a general marketing communication for information purposes only and does not constitute an independent investment research. Nothing in this communication contains, or should be considered as containing, an investment advice or an investment recommendation or a solicitation for the purpose of buying or selling of any financial instrument. All information provided is gathered from reputable sources and any information containing an indication of past performance is not a guarantee or reliable indicator of future performance. Users acknowledge that any investment in Leveraged Products is characterized by a certain degree of uncertainty and that any investment of this nature involves a high level of risk for which the users are solely responsible and liable. We assume no liability for any loss arising from any investment made based on the information provided in this communication. This communication must not be reproduced or further distributed without our prior written permission.
×
×
  • Create New...

Important Information

By using this site, you agree to our Terms of Use.