Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_17361 |
Symbol | thiC |
ID | 5730933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1563504 |
End bp | 1564883 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641286121 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001551621 |
Protein GI | 159904277 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCTT CATGGGTTGC TAACAGGCAA GGCAAAAGCA ATGTTTCTCA ATTGCATTTT GCCCGACAAG GCATGATTAC TGAAGAGATG GCGTATGTAG CCAATAGAGA AAATCTTCCT GAGTCTTTAG TCATGGAAGA AGTTGCTCGA GGCCGCATGA TAATTCCAGC GAATATTAAT CATCTAAATT TGGAACCTAT GGCTATTGGT ATTGCTTCCA AATGCAAAGT TAATGCCAAT ATTGGGGCCT CTCCTAATGC AAGTGATGTA GGTGAAGAGC TGAAAAAGCT TGAATTAGCT GTTAAGTATG GAGCTGATAC CGTGATGGAC CTTTCTACTG GAGGGGTCAA TCTAGATGAA GTGCGAACTG CAATTATCAA TGCATCTCCA GTACCTATTG GAACTGTTCC TGTTTATCAA GCTTTAGAAA GTGTTCATGG ATCTATTGAA AAATTATCAG AGGAAGATTT CCTTCACATA ATTGAGAAGC ATTGCCAGCA AGGCGTTGAT TATCAAACTA TTCATGCTGG TTTACTTATA GAGCATCTAC CCAAGGTTAA AGGAAGATTA ACTGGGATAG TTAGTCGCGG TGGCGGCATC CTGGCTCAAT GGATGCTCTA TCACCACAAA CAAAATCCTT TATTCTCTAG ATTTGATGAT ATTTGTGAGA TTTTCAAGCG ATATGATTGC AGTTTTTCAC TAGGAGACTC TCTTCGCCCA GGGTGTTTGC ATGATGCTTC TGATGAGGCT CAATTGGCTG AATTGAAAAC TTTGGGCCAG TTAACTAAAC GTGCCTGGGC TCATGATATT CAAGTAATGG TTGAAGGACC GGGCCATGTG CCGATGGATC AGATTGAATT TAATGTTCGG AAACAGATGG AGGATTGTTC GGAAGCACCA TTTTATGTTT TAGGCCCTTT GGTTACCGAT ATAGCACCTG GTTATGACCA CATCACAAGC GCTATTGGAG CTGCAATGGC TGGTTGGTAT GGCACCGCAA TGCTTTGTTA TGTGACACCT AAAGAACATC TCGGACTACC AAACCCTGAG GATGTTCGAG AAGGATTAAT CGCCTATAAA ATTGCTGCTC ATGCTGCAGA TATAGCTAGA CATCGTTCTG GTGCAAGAGA TAGAGATGAT GAACTTAGTA AAGCTAGATA TGCTTTTGAT TGGAACAAGC AATTTGAATT ATCCCTTGAT CCAGAGAGGG CTCGTCAATA TCATGATGAA ACTCTTCCTG CAGATATATA TAAACAAGCA GAGTTTTGTT CAATGTGTGG TCCCAAGCAT TGCCCTATGC AAACTAAGAT TACGGATAAA GATTTAGATG ATCTCGAGGA TGTAATTAAA TCAAAAGATG CCTCTAAAAT AAATCTATAA
|
Protein sequence | MRASWVANRQ GKSNVSQLHF ARQGMITEEM AYVANRENLP ESLVMEEVAR GRMIIPANIN HLNLEPMAIG IASKCKVNAN IGASPNASDV GEELKKLELA VKYGADTVMD LSTGGVNLDE VRTAIINASP VPIGTVPVYQ ALESVHGSIE KLSEEDFLHI IEKHCQQGVD YQTIHAGLLI EHLPKVKGRL TGIVSRGGGI LAQWMLYHHK QNPLFSRFDD ICEIFKRYDC SFSLGDSLRP GCLHDASDEA QLAELKTLGQ LTKRAWAHDI QVMVEGPGHV PMDQIEFNVR KQMEDCSEAP FYVLGPLVTD IAPGYDHITS AIGAAMAGWY GTAMLCYVTP KEHLGLPNPE DVREGLIAYK IAAHAADIAR HRSGARDRDD ELSKARYAFD WNKQFELSLD PERARQYHDE TLPADIYKQA EFCSMCGPKH CPMQTKITDK DLDDLEDVIK SKDASKINL
|
| |