Gene P9211_17361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_17361 
SymbolthiC 
ID5730933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1563504 
End bp1564883 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content41% 
IMG OID641286121 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001551621 
Protein GI159904277 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCTT CATGGGTTGC TAACAGGCAA GGCAAAAGCA ATGTTTCTCA ATTGCATTTT 
GCCCGACAAG GCATGATTAC TGAAGAGATG GCGTATGTAG CCAATAGAGA AAATCTTCCT
GAGTCTTTAG TCATGGAAGA AGTTGCTCGA GGCCGCATGA TAATTCCAGC GAATATTAAT
CATCTAAATT TGGAACCTAT GGCTATTGGT ATTGCTTCCA AATGCAAAGT TAATGCCAAT
ATTGGGGCCT CTCCTAATGC AAGTGATGTA GGTGAAGAGC TGAAAAAGCT TGAATTAGCT
GTTAAGTATG GAGCTGATAC CGTGATGGAC CTTTCTACTG GAGGGGTCAA TCTAGATGAA
GTGCGAACTG CAATTATCAA TGCATCTCCA GTACCTATTG GAACTGTTCC TGTTTATCAA
GCTTTAGAAA GTGTTCATGG ATCTATTGAA AAATTATCAG AGGAAGATTT CCTTCACATA
ATTGAGAAGC ATTGCCAGCA AGGCGTTGAT TATCAAACTA TTCATGCTGG TTTACTTATA
GAGCATCTAC CCAAGGTTAA AGGAAGATTA ACTGGGATAG TTAGTCGCGG TGGCGGCATC
CTGGCTCAAT GGATGCTCTA TCACCACAAA CAAAATCCTT TATTCTCTAG ATTTGATGAT
ATTTGTGAGA TTTTCAAGCG ATATGATTGC AGTTTTTCAC TAGGAGACTC TCTTCGCCCA
GGGTGTTTGC ATGATGCTTC TGATGAGGCT CAATTGGCTG AATTGAAAAC TTTGGGCCAG
TTAACTAAAC GTGCCTGGGC TCATGATATT CAAGTAATGG TTGAAGGACC GGGCCATGTG
CCGATGGATC AGATTGAATT TAATGTTCGG AAACAGATGG AGGATTGTTC GGAAGCACCA
TTTTATGTTT TAGGCCCTTT GGTTACCGAT ATAGCACCTG GTTATGACCA CATCACAAGC
GCTATTGGAG CTGCAATGGC TGGTTGGTAT GGCACCGCAA TGCTTTGTTA TGTGACACCT
AAAGAACATC TCGGACTACC AAACCCTGAG GATGTTCGAG AAGGATTAAT CGCCTATAAA
ATTGCTGCTC ATGCTGCAGA TATAGCTAGA CATCGTTCTG GTGCAAGAGA TAGAGATGAT
GAACTTAGTA AAGCTAGATA TGCTTTTGAT TGGAACAAGC AATTTGAATT ATCCCTTGAT
CCAGAGAGGG CTCGTCAATA TCATGATGAA ACTCTTCCTG CAGATATATA TAAACAAGCA
GAGTTTTGTT CAATGTGTGG TCCCAAGCAT TGCCCTATGC AAACTAAGAT TACGGATAAA
GATTTAGATG ATCTCGAGGA TGTAATTAAA TCAAAAGATG CCTCTAAAAT AAATCTATAA
 
Protein sequence
MRASWVANRQ GKSNVSQLHF ARQGMITEEM AYVANRENLP ESLVMEEVAR GRMIIPANIN 
HLNLEPMAIG IASKCKVNAN IGASPNASDV GEELKKLELA VKYGADTVMD LSTGGVNLDE
VRTAIINASP VPIGTVPVYQ ALESVHGSIE KLSEEDFLHI IEKHCQQGVD YQTIHAGLLI
EHLPKVKGRL TGIVSRGGGI LAQWMLYHHK QNPLFSRFDD ICEIFKRYDC SFSLGDSLRP
GCLHDASDEA QLAELKTLGQ LTKRAWAHDI QVMVEGPGHV PMDQIEFNVR KQMEDCSEAP
FYVLGPLVTD IAPGYDHITS AIGAAMAGWY GTAMLCYVTP KEHLGLPNPE DVREGLIAYK
IAAHAADIAR HRSGARDRDD ELSKARYAFD WNKQFELSLD PERARQYHDE TLPADIYKQA
EFCSMCGPKH CPMQTKITDK DLDDLEDVIK SKDASKINL