Gene P9211_13191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_13191 
SymbolthiE 
ID5731714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1190190 
End bp1191221 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content39% 
IMG OID641285690 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001551204 
Protein GI159903860 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.422887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTCA TCCCTACTCC TGAAAAACAT GTTGCTCAGC TGATAGATGC AAACCTTGAT 
AGAGCAAGAG AAGGTTTAAG AGTCATTGAA GATTGGTGCA GATATGGCCT GCAAAGAAAA
GACTTGATAA TTATTATTAA GGATTATCGT CATCAATTAG GTCGCCTCCA TCAAAAGACT
TATAAGCAAG CTCGCTCAGC ACAGACTGAC CAAGGTTCAG GATTAACTCA CGAAGCTCAA
AATGATCGTA TAAGTCCTCT ACAAATCGTG AGTGCTAATT GTGCAAGAGT TCAAGAAGCT
CTTCGCGTGA TAGAAGAATT TGCCAGAAAT ATTGATCCAG AGCTTACAAA AGCAGCCTCA
AAAATTCGTT ATGAGATTTA CGATCTAGAA ATCAATATTC AAGAAGCAAC TTCTGGGAAA
AAACGTCAAA AGGAACTCTC AGCTTGCAAA TTATGCTTAA TAACGACCCC CCACCAAGAG
CTTATAGCAA AGGTTTCTGC AGGATTAAAG GCAGGAGTAG GGATGGTGCA ATATCGCTGT
AAAAAGGGTA AAGACATTGA CAAGTTTTCT GAAGCTGAAA AACTTGCTGT GATTTGCAAA
GATTATGGTG CTCTCTTTAT AGTCAATGAC CGAATCGATA TAGCTCTAGC CGTAGATGCT
GACGGTATTC ATATTGGTCA AGAAGACTTA CCACTAGATA TAGCAAGAAA GCTTATAGGT
CCCGAAAAGT TAATAGGAGT TAGTTGCCAT TCTCTTGAAG AAGCTCAAAA GGCAGATAAA
AATGGTTCTG ACTATATAGG CTTTGGTCCT ATTTTTCGAA CTACTTCAAA GCCAGAGGTT
GCTCCACTTG GTCTTGAGTG CTTAAAGCAA ATATCAAATT CAATTAATCA ACCTTGTTTT
GCAATTGGTG GAATCAACCA TCTCAACAGA TCTAAATTGT TGTCAACTGG AGTCTCTCGA
ATAGCAGTCA TTGATGCCAT CATGAAAGCA GAAGACCCTT TTCAAGCAAG CAAGCAGCTA
CTTGAGATTT GA
 
Protein sequence
MAVIPTPEKH VAQLIDANLD RAREGLRVIE DWCRYGLQRK DLIIIIKDYR HQLGRLHQKT 
YKQARSAQTD QGSGLTHEAQ NDRISPLQIV SANCARVQEA LRVIEEFARN IDPELTKAAS
KIRYEIYDLE INIQEATSGK KRQKELSACK LCLITTPHQE LIAKVSAGLK AGVGMVQYRC
KKGKDIDKFS EAEKLAVICK DYGALFIVND RIDIALAVDA DGIHIGQEDL PLDIARKLIG
PEKLIGVSCH SLEEAQKADK NGSDYIGFGP IFRTTSKPEV APLGLECLKQ ISNSINQPCF
AIGGINHLNR SKLLSTGVSR IAVIDAIMKA EDPFQASKQL LEI