Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18041 |
Symbol | thiC |
ID | 4911553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1526173 |
End bp | 1527543 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640161408 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001092028 |
Protein GI | 126697142 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGTT CTTGGATTAA GCCTCGCCTT GGGAAAGACA ATGTAACTCA GATGAACTTT GCGAGAAACG GATATATCAC CGAAGAAATG GATTTTGTTG CTAAAAAAGA GAATCTACCT CCTTCTTTAA TAATGGAGGA AGTGGCAAGA GGAAGATTAA TTATTCCAGC TAATATTAAT CATTTGAATC TTGAGCCAAT GTCTATAGGG GTTGCTTCTC GATGCAAAGT TAATGCCAAT ATTGGTGCTT CTCCCAATGC AAGTGATATA AATGAAGAAG TAGAAAAGCT TAAACTAGCT GTAAAATATG GTGCTGATAC GGTTATGGAT CTTTCTACGG GAGGAGTAAA TTTAGATGAA GTGCGGCAAG CAATTATTCA AGAATCTCCA GTTCCCATAG GAACTGTTCC TGTTTATCAA GCATTAGAAA GTGTACATGG TTCAATCGAT AGACTAACAG AAGACGATTT TCTTCATATT ATTGAAAAAC ATTGCCAGCA AGGAGTAGAT TATCAAACTA TTCATGCTGG TCTATTAATA GAGCATTTAC CAAAAGTTAA AGGAAGAATC ACTGGAATTG TCAGTAGAGG GGGAGGTATT TTAGCTCAAT GGATGCTACA TCATTTTAAG CAAAATCCCC TCTATACAAG GTTTGATGAT ATCTGTGAGA TTTTTAAGAA ATATGATTGT ACTTTTTCTC TAGGAGATTC ACTAAGGCCT GGATGTTTGC ATGATGCTTC TGATGATGCT CAACTAGCTG AATTGAAGAC CTTAGGTGAG CTTACTCGAA GAGCATGGGA ACATAATGTT CAAGTAATGG TTGAGGGCCC TGGTCATGTA CCTATGGACC AAATTGAGTT TAATGTGAGA AAGCAAATGG AAGAATGTTC AGAAGCTCCT TTCTATGTAC TTGGTCCATT AGTTACAGAT ATTTCTCCTG GTTATGACCA TATTTCAAGT GCTATTGGGG CGGCTATGGC AGGATGGTAT GGAACGTCTA TGTTATGTTA CGTAACCCCA AAAGAACATC TAGGCCTCCC AAATGCAGAA GATGTACGAG AAGGATTAAT TGCTTATAAA ATAGCCGCTC ACGCTGCTGA TATAGCAAGA CATAGAGCTG GTGCTCGTGA TAGAGATGAT GAACTTAGTC ATGCAAGGTA TAACTTTGAT TGGAATAAAC AATTCGAACT TTCTTTAGAT CCAGAGAGGG CAAAGCAGTA CCATGATGAA ACACTACCTG AAGAAATCTT TAAAAAGGCT GAGTTTTGTT CAATGTGTGG TCCAAAACAT TGTCCAATGA ATTCAAAGAT TTCAGATGAA TCTCTTGATC AGTTAAAAGA TAAACTTGAA GAATGTAGTA CTTCAGCTTA G
|
Protein sequence | MRSSWIKPRL GKDNVTQMNF ARNGYITEEM DFVAKKENLP PSLIMEEVAR GRLIIPANIN HLNLEPMSIG VASRCKVNAN IGASPNASDI NEEVEKLKLA VKYGADTVMD LSTGGVNLDE VRQAIIQESP VPIGTVPVYQ ALESVHGSID RLTEDDFLHI IEKHCQQGVD YQTIHAGLLI EHLPKVKGRI TGIVSRGGGI LAQWMLHHFK QNPLYTRFDD ICEIFKKYDC TFSLGDSLRP GCLHDASDDA QLAELKTLGE LTRRAWEHNV QVMVEGPGHV PMDQIEFNVR KQMEECSEAP FYVLGPLVTD ISPGYDHISS AIGAAMAGWY GTSMLCYVTP KEHLGLPNAE DVREGLIAYK IAAHAADIAR HRAGARDRDD ELSHARYNFD WNKQFELSLD PERAKQYHDE TLPEEIFKKA EFCSMCGPKH CPMNSKISDE SLDQLKDKLE ECSTSA
|
| |