Gene OSTLU_119522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119522 
SymbolAcr2 
ID5000475 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp479209 
End bp482789 
Gene Length3581 bp 
Protein Length1145 aa 
Translation table 
GC content46% 
IMG OID640415896 
ProductDNA-directed RNA polymerase I polypeptide 2 
Protein accessionXP_001416405 
Protein GI145343599 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.050944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTCT TGAACGCCTC TGTACGCCGT GGAAGATTGT TGCGTATACA CCACACCATC 
CTACACGAGT TCGACAATCG TTCTGTAGCG AAAGAGGATG CGTAACGCAA ACTCGCCAGA
AATAACAGGC GACTCGGTTC CTTCCGAGGT ACCAATAAAG AAGTTGTTTG CTTGTCACGT
AGACTCGTTT AATCACCTCA CCAATCTGGG ATTCGATCAG ATATTGCGAT GTGTGAAACC
TGAAAAGTTC CAACAATCGG CTGGCTGTCC AGAACTCATG CTATGGCTTG AGGACTTGAA
ATTGGAGTCA CTTAAATTTC GCCATCGCAT TTCGGGACGC ACGGAGAGTC AAAAGACGCC
CAGAGGTTGT CGTGAAGGAG GTGAAAGTTA CAAGGCGCCA CTCTCAGTTC AGTTTTGTTG
GCAAATAGAT GGAGAAAATG TGCAGAGACG CGTTGTAAAT CTCGGAGATT GTCCAGTGAT
GGTCAAGTCT GATGTTTGCT CGCTCGCACT ACTTTCTCCA GCACAACTAG TGGCACGTGG
TGAAGAAGCA CACGAAGCAG GAGGATATTT CATCATCAAT GGTATCGAAC GAATGATTCG
TATGATCATA CAACAACGGA GGCATCATAT TCTTGGACTC TGCCGACGAG CTTTTACGAA
ACGCTCACCT TTGTTTTCTG AGTTCGCTAC TGTCATTCGC TGTGTCACTG AGGATGAGCA
CTCGTCTATA GTTCGCTTGC ACTATATGCG AACCGGTGCG CTACGTCTGG CTCTCCAGCA
TCGACGACAA GAGTTTTTCA TACCAGCTGG CATAGTATTA CGGGCACTTG CAGTATGCTC
TGACGCTGAG ATGTATCGGC AAGTGTGCCT GCATTTGCGA AGCGCTGGAA CTGATGAAAC
CTTCATTGAG GACCGTTTGG CTCTACTTCA ACGAGAATGT CATGAACTTC AAATACGAAC
GCAAATGTGT GCGTTATCTT ACTTAGGACA GCATTTTCGT ACACTTTTAG ATCTTTCTTC
TGACGAAAGT GATATTTCTG TCGGTGAGCG CTTTCTCGAA GATTTTATTT ATGTTCATCT
TCAACATAAT GGTGACAAAT TGTCGTTGCT GATTCTCATG TTGAGTAAGC TTCTGTCGAT
CGTGACGGGT CGGTGTAGTC CCGACGACCC GGACTCGCTA GTCAATCAAG AAGTCCTGGT
GCCCGGCCTC CTCCTTCAAG CTATGATTCG TGAGAAGGTG CGGATAGCTT TCCAGAAGGT
AGTGACGCAT CTTCGACGCA CACAGGGCAG CTGGAGTGAG GAGACTATTT CGCATCTCAT
CAATGAATCT GGCTCTAGTG ATGTAGGGAA AGTAGTGGAA TATTTCCTAG CCACCGGGAA
CTTAGTTAGT CCAACTGGGC TCGGTTTAAG CCAAACAAGT GGTTTTACTA TTGTTGCAGA
GAAGCTAAAT TACTTCAGGT ATATTTCACA CTTTCGCTCC GTACATAGAG GAGCGTATTT
TATGGAACTA CGCACGACAA CTGTTCGTAA GCTCTTACCC GAATCATGGG GCTTCTTATG
CCCAGTACAT ACCCCAGATG GATCACCTTG TGGTTTACTG AATCATTTGG CAGAGATGTG
TGAAATTGTC ATGCCTGACA CAGATCATGT GTTTCAACAA AAGCGTTTAC TGCAGATACA
CTCGGTACTG GATAGGGCTC TTATTAGTGT AGAACAATAT GACTGTGGAT ACGCGCATGT
TCCAGTCGTG CTGGAAGGTT CGTTTGTAGG CTATATCTCG GCAGAAAATG CATCCCATGT
GATTTCTGCT CTGAGAGCGT TTAAGGTGAC CTTGACAACG TCAGTTTTGC GCATGAGTGA
AATCTCTTAT ATTCATCCCG GTGGTGAACA TGGTTTGTTT CCAGGTTTGT ATATTTTCTA
TGGTCCTTCT AGGTTGATGA GACCAGTGAA GCAAGTAGAG AGCATGAAAG TTGAGTTCAT
CGGTACACTT GAACAAGCCT TCCTTTCTAT TTCTGCACAT CAAGTAGAAA GTCACAATGC
ATCATATACG CACGCAGAGA TTAGAAACAC TTCTGCGCTA AGCTGCGTCG CAAGCTTAAC
GCCTTGGTCC GACTTCAATC AGAGTCCCCG TAATATGTAT CAGTGCCAGA TGGCCAAGCA
GACGATGGGA ACACCAATGC ACACTATTTG CTACCGGAGT GATACCAAAC TTTACCGTTT
ACATACTCCG CAGCGACCCT TAGCACTAAC ATGCACTTAC GACAAATATT CGCTAGACGA
TTATGCGCTT GGCACAAATG CAGTTGTAGC CGTCATTGCA TACACTGGAT ACGACATGGA
AGACGCAATG ATTGTGAACA AGGGTTCGCT CCATCGCGGT TTCGCACATG CCACATTATA
CAAAACGCTT GTTGAGGGTA TATCAGCAAA TGAGACATTA AGTCGGAGGG ATAACACCTA
TACCAGTGAA AATAAGCAGC TCGACGGTAC TGGCTCTGTG CAGCTGGGCT CTATTGTTCG
GCCAGGCGAC ACACTTCTAA ATTTACACAG CTCAGATGGT GTAACCAAGG GTAGATCGAT
TCGTCTTCGA GGAACAGACG CAGCGGTAGT AGATAAGGTT GTGCTAACTC AGTCTGTTAA
GCAGGCTGCG ACGAAGAATG ACAAACGTGC TGCTATTACT CTTCGTTATG ATAGAAATCC
AGTCATAGGG GATAAGTTCA GCAGTCGTCA TGGACAAAAG GGCGTTCTCA GCTTCCTGTG
GCCAGAAGAA GATATGCCTT TTAGTGACCG CACAGGTCTT CGACCCGACG TTATCATCAA
TCCGCATGCC TTTCCGTCGC GAATGACTAT TGGTATGCTT GTTGAGAGTT TGGCTGCCAA
AGCTGGTGCT AGCACAGGAA TTTTTGCGGA TGCAACTCCT TTCAAGCATA GTGACAAGGA
GATTTCACCC ACAGAAGAAT ATGGCAAGTT GCTTCGAGAA AGTGGCTACA ATTTCTGTGG
TAGTGAACGT TTGGTCAATG GTTGTACAGG TGAGAGCTTC AGTGTTGATA TTTTTATTGG
TCTCGTGTAC TACCAACGTC TGCGCCATAT GGTGAGTCTT TTTCATACTT TTCATGGTTC
TCTCATTACA AGTAACTAGG TGAGCGATAA ATTTCAAGTG CGGTCAACTG GCCCAAACAA
TCCTCTCACG ATGCAGCCGA TCAAAGGAAG AAAGTCAGGT GGTGGTATAC GTTTCGGTGA
AATGGAACGC GATTCACTCC TTGCCCATGG TGTAGCATAC TTGGTGAGCT TGTTTGGATT
TATTCCGTCG TAAACTCTAA CACATTTAGC AGCTGCGTGA TAGGTTGCAT ATCTGCTCAG
ATAATCGAGA TGCTTGGGTT TGTAATATGT GTGGTAGCTT GATAGCCCCT TTGACATGCG
TGGCGTCTAT CCGTACTGAC GAATACTCGT CTCGCAGAAC GAGTTGTCGG GTCTGTGATT
CAGCACACAA ACTTGAACGC ATTTCCATTC CACATGTTTT CATTTATCTT ACGGCAGAGC
TTGCGGCGAT GAACATATCA GTACAAGTCA AAGCGAAGTA A
 
Protein sequence
MRFLNASVRR GRLLRIHHTI LHEFDNRDSV PSEVPIKKLF ACHVDSFNHL TNLGFDQILR 
CVKPEKFQQS AGCPELMLWL EDLKLESLKF RHRISGRTES QKTPRGCREG GESYKAPLSV
QFCWQIDGEN VQRRVVNLGD CPVMVKSDVC SLALLSPAQL VARGEEAHEA GGYFIINGIE
RMIRMIIQQR RHHILGLCRR AFTKRSPLFS EFATVIRCVT EDEHSSIVRL HYMRTGALRL
ALQHRRQEFF IPAGIVLRAL AVCSDAEMYR QVCLHLRSAG TDETFIEDRL ALLQRECHEL
QIRTQMCALS YLGQHFRTLL DLSSDESDIS VGERFLEDFI YVHLQHNGDK LSLLILMLSK
LLSIVTGRCS PDDPDSLVNQ EVLVPGLLLQ AMIREKVRIA FQKVVTHLRR TQGSWSEETI
SHLINESGSS DVGKVVEYFL ATGNLVSPTG LGLSQTSGFT IVAEKLNYFR YISHFRSVHR
GAYFMELRTT TVRKLLPESW GFLCPVHTPD GSPCGLLNHL AEMCEIVMPD TDHVFQQKRL
LQIHSVLDRA LISVEQYDCG YAHVPVVLEG SFVGYISAEN ASHVISALRA FKVTLTTSVL
RMSEISYIHP GGEHGLFPGL YIFYGPSRLM RPVKQVESMK VEFIGTLEQA FLSISAHQVE
SHNASYTHAE IRNTSALSCV ASLTPWSDFN QSPRNMYQCQ MAKQTMGTPM HTICYRSDTK
LYRLHTPQRP LALTCTYDKY SLDDYALGTN AVVAVIAYTG YDMEDAMIVN KGSLHRGFAH
ATLYKTLVEG ISANETLSRR DNTYTSENKQ LDGTGSVQLG SIVRPGDTLL NLHSSDGVTK
GRSIRLRGTD AAVVDKVVLT QSVKQAATKN DKRAAITLRY DRNPVIGDKF SSRHGQKGVL
SFLWPEEDMP FSDRTGLRPD VIINPHAFPS RMTIGMLVES LAAKAGASTG IFADATPFKH
SDKEISPTEE YGKLLRESGY NFCGSERLVN GCTGESFSVD IFIGLVYYQR LRHMVSLFHT
FHVRSTGPNN PLTMQPIKGR KSGGGIRFGE MERDSLLAHG VAYLVSLLHI CSDNRDAWVC
NMCGSLIAPL TCVASIRTDE YSSRRTSCRV CDSAHKLERI SIPHVFIYLT AELAAMNISV
QVKAK