Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119522 |
Symbol | Acr2 |
ID | 5000475 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 479209 |
End bp | 482789 |
Gene Length | 3581 bp |
Protein Length | 1145 aa |
Translation table | |
GC content | 46% |
IMG OID | 640415896 |
Product | DNA-directed RNA polymerase I polypeptide 2 |
Protein accession | XP_001416405 |
Protein GI | 145343599 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.050944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTCT TGAACGCCTC TGTACGCCGT GGAAGATTGT TGCGTATACA CCACACCATC CTACACGAGT TCGACAATCG TTCTGTAGCG AAAGAGGATG CGTAACGCAA ACTCGCCAGA AATAACAGGC GACTCGGTTC CTTCCGAGGT ACCAATAAAG AAGTTGTTTG CTTGTCACGT AGACTCGTTT AATCACCTCA CCAATCTGGG ATTCGATCAG ATATTGCGAT GTGTGAAACC TGAAAAGTTC CAACAATCGG CTGGCTGTCC AGAACTCATG CTATGGCTTG AGGACTTGAA ATTGGAGTCA CTTAAATTTC GCCATCGCAT TTCGGGACGC ACGGAGAGTC AAAAGACGCC CAGAGGTTGT CGTGAAGGAG GTGAAAGTTA CAAGGCGCCA CTCTCAGTTC AGTTTTGTTG GCAAATAGAT GGAGAAAATG TGCAGAGACG CGTTGTAAAT CTCGGAGATT GTCCAGTGAT GGTCAAGTCT GATGTTTGCT CGCTCGCACT ACTTTCTCCA GCACAACTAG TGGCACGTGG TGAAGAAGCA CACGAAGCAG GAGGATATTT CATCATCAAT GGTATCGAAC GAATGATTCG TATGATCATA CAACAACGGA GGCATCATAT TCTTGGACTC TGCCGACGAG CTTTTACGAA ACGCTCACCT TTGTTTTCTG AGTTCGCTAC TGTCATTCGC TGTGTCACTG AGGATGAGCA CTCGTCTATA GTTCGCTTGC ACTATATGCG AACCGGTGCG CTACGTCTGG CTCTCCAGCA TCGACGACAA GAGTTTTTCA TACCAGCTGG CATAGTATTA CGGGCACTTG CAGTATGCTC TGACGCTGAG ATGTATCGGC AAGTGTGCCT GCATTTGCGA AGCGCTGGAA CTGATGAAAC CTTCATTGAG GACCGTTTGG CTCTACTTCA ACGAGAATGT CATGAACTTC AAATACGAAC GCAAATGTGT GCGTTATCTT ACTTAGGACA GCATTTTCGT ACACTTTTAG ATCTTTCTTC TGACGAAAGT GATATTTCTG TCGGTGAGCG CTTTCTCGAA GATTTTATTT ATGTTCATCT TCAACATAAT GGTGACAAAT TGTCGTTGCT GATTCTCATG TTGAGTAAGC TTCTGTCGAT CGTGACGGGT CGGTGTAGTC CCGACGACCC GGACTCGCTA GTCAATCAAG AAGTCCTGGT GCCCGGCCTC CTCCTTCAAG CTATGATTCG TGAGAAGGTG CGGATAGCTT TCCAGAAGGT AGTGACGCAT CTTCGACGCA CACAGGGCAG CTGGAGTGAG GAGACTATTT CGCATCTCAT CAATGAATCT GGCTCTAGTG ATGTAGGGAA AGTAGTGGAA TATTTCCTAG CCACCGGGAA CTTAGTTAGT CCAACTGGGC TCGGTTTAAG CCAAACAAGT GGTTTTACTA TTGTTGCAGA GAAGCTAAAT TACTTCAGGT ATATTTCACA CTTTCGCTCC GTACATAGAG GAGCGTATTT TATGGAACTA CGCACGACAA CTGTTCGTAA GCTCTTACCC GAATCATGGG GCTTCTTATG CCCAGTACAT ACCCCAGATG GATCACCTTG TGGTTTACTG AATCATTTGG CAGAGATGTG TGAAATTGTC ATGCCTGACA CAGATCATGT GTTTCAACAA AAGCGTTTAC TGCAGATACA CTCGGTACTG GATAGGGCTC TTATTAGTGT AGAACAATAT GACTGTGGAT ACGCGCATGT TCCAGTCGTG CTGGAAGGTT CGTTTGTAGG CTATATCTCG GCAGAAAATG CATCCCATGT GATTTCTGCT CTGAGAGCGT TTAAGGTGAC CTTGACAACG TCAGTTTTGC GCATGAGTGA AATCTCTTAT ATTCATCCCG GTGGTGAACA TGGTTTGTTT CCAGGTTTGT ATATTTTCTA TGGTCCTTCT AGGTTGATGA GACCAGTGAA GCAAGTAGAG AGCATGAAAG TTGAGTTCAT CGGTACACTT GAACAAGCCT TCCTTTCTAT TTCTGCACAT CAAGTAGAAA GTCACAATGC ATCATATACG CACGCAGAGA TTAGAAACAC TTCTGCGCTA AGCTGCGTCG CAAGCTTAAC GCCTTGGTCC GACTTCAATC AGAGTCCCCG TAATATGTAT CAGTGCCAGA TGGCCAAGCA GACGATGGGA ACACCAATGC ACACTATTTG CTACCGGAGT GATACCAAAC TTTACCGTTT ACATACTCCG CAGCGACCCT TAGCACTAAC ATGCACTTAC GACAAATATT CGCTAGACGA TTATGCGCTT GGCACAAATG CAGTTGTAGC CGTCATTGCA TACACTGGAT ACGACATGGA AGACGCAATG ATTGTGAACA AGGGTTCGCT CCATCGCGGT TTCGCACATG CCACATTATA CAAAACGCTT GTTGAGGGTA TATCAGCAAA TGAGACATTA AGTCGGAGGG ATAACACCTA TACCAGTGAA AATAAGCAGC TCGACGGTAC TGGCTCTGTG CAGCTGGGCT CTATTGTTCG GCCAGGCGAC ACACTTCTAA ATTTACACAG CTCAGATGGT GTAACCAAGG GTAGATCGAT TCGTCTTCGA GGAACAGACG CAGCGGTAGT AGATAAGGTT GTGCTAACTC AGTCTGTTAA GCAGGCTGCG ACGAAGAATG ACAAACGTGC TGCTATTACT CTTCGTTATG ATAGAAATCC AGTCATAGGG GATAAGTTCA GCAGTCGTCA TGGACAAAAG GGCGTTCTCA GCTTCCTGTG GCCAGAAGAA GATATGCCTT TTAGTGACCG CACAGGTCTT CGACCCGACG TTATCATCAA TCCGCATGCC TTTCCGTCGC GAATGACTAT TGGTATGCTT GTTGAGAGTT TGGCTGCCAA AGCTGGTGCT AGCACAGGAA TTTTTGCGGA TGCAACTCCT TTCAAGCATA GTGACAAGGA GATTTCACCC ACAGAAGAAT ATGGCAAGTT GCTTCGAGAA AGTGGCTACA ATTTCTGTGG TAGTGAACGT TTGGTCAATG GTTGTACAGG TGAGAGCTTC AGTGTTGATA TTTTTATTGG TCTCGTGTAC TACCAACGTC TGCGCCATAT GGTGAGTCTT TTTCATACTT TTCATGGTTC TCTCATTACA AGTAACTAGG TGAGCGATAA ATTTCAAGTG CGGTCAACTG GCCCAAACAA TCCTCTCACG ATGCAGCCGA TCAAAGGAAG AAAGTCAGGT GGTGGTATAC GTTTCGGTGA AATGGAACGC GATTCACTCC TTGCCCATGG TGTAGCATAC TTGGTGAGCT TGTTTGGATT TATTCCGTCG TAAACTCTAA CACATTTAGC AGCTGCGTGA TAGGTTGCAT ATCTGCTCAG ATAATCGAGA TGCTTGGGTT TGTAATATGT GTGGTAGCTT GATAGCCCCT TTGACATGCG TGGCGTCTAT CCGTACTGAC GAATACTCGT CTCGCAGAAC GAGTTGTCGG GTCTGTGATT CAGCACACAA ACTTGAACGC ATTTCCATTC CACATGTTTT CATTTATCTT ACGGCAGAGC TTGCGGCGAT GAACATATCA GTACAAGTCA AAGCGAAGTA A
|
Protein sequence | MRFLNASVRR GRLLRIHHTI LHEFDNRDSV PSEVPIKKLF ACHVDSFNHL TNLGFDQILR CVKPEKFQQS AGCPELMLWL EDLKLESLKF RHRISGRTES QKTPRGCREG GESYKAPLSV QFCWQIDGEN VQRRVVNLGD CPVMVKSDVC SLALLSPAQL VARGEEAHEA GGYFIINGIE RMIRMIIQQR RHHILGLCRR AFTKRSPLFS EFATVIRCVT EDEHSSIVRL HYMRTGALRL ALQHRRQEFF IPAGIVLRAL AVCSDAEMYR QVCLHLRSAG TDETFIEDRL ALLQRECHEL QIRTQMCALS YLGQHFRTLL DLSSDESDIS VGERFLEDFI YVHLQHNGDK LSLLILMLSK LLSIVTGRCS PDDPDSLVNQ EVLVPGLLLQ AMIREKVRIA FQKVVTHLRR TQGSWSEETI SHLINESGSS DVGKVVEYFL ATGNLVSPTG LGLSQTSGFT IVAEKLNYFR YISHFRSVHR GAYFMELRTT TVRKLLPESW GFLCPVHTPD GSPCGLLNHL AEMCEIVMPD TDHVFQQKRL LQIHSVLDRA LISVEQYDCG YAHVPVVLEG SFVGYISAEN ASHVISALRA FKVTLTTSVL RMSEISYIHP GGEHGLFPGL YIFYGPSRLM RPVKQVESMK VEFIGTLEQA FLSISAHQVE SHNASYTHAE IRNTSALSCV ASLTPWSDFN QSPRNMYQCQ MAKQTMGTPM HTICYRSDTK LYRLHTPQRP LALTCTYDKY SLDDYALGTN AVVAVIAYTG YDMEDAMIVN KGSLHRGFAH ATLYKTLVEG ISANETLSRR DNTYTSENKQ LDGTGSVQLG SIVRPGDTLL NLHSSDGVTK GRSIRLRGTD AAVVDKVVLT QSVKQAATKN DKRAAITLRY DRNPVIGDKF SSRHGQKGVL SFLWPEEDMP FSDRTGLRPD VIINPHAFPS RMTIGMLVES LAAKAGASTG IFADATPFKH SDKEISPTEE YGKLLRESGY NFCGSERLVN GCTGESFSVD IFIGLVYYQR LRHMVSLFHT FHVRSTGPNN PLTMQPIKGR KSGGGIRFGE MERDSLLAHG VAYLVSLLHI CSDNRDAWVC NMCGSLIAPL TCVASIRTDE YSSRRTSCRV CDSAHKLERI SIPHVFIYLT AELAAMNISV QVKAK
|
| |