Gene P9515_14351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_14351 
SymbolthiE 
ID4718721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1272654 
End bp1273715 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content30% 
IMG OID640081122 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001011749 
Protein GI123966668 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAC CAAAAATAAA TCAACCGGAA GACTTACGAA TTTCTCAGAT TATTGACGCC 
AATCTAGATA GAGCAAGAGA AGGATTAAGA GTCTTAGAGG ACTGGGCCAG ATTTGGCTTA
GGAAATGAAG ATTTTGTCAT AAGAATAAAA AACCTCCGAC AAATATTAGG TAAAAATCAT
TTAGAAATTT ACAAAAAATC AAGAAATCAT ATAGAAGATC AATGTAAAGG GTTATCTCAT
ATTGAACAAA TCCACCGGAA AAGTCCCTCT AAAATAATAA GTTCTAATTC TGCTAGGGTT
CAAGAGGCTC TTAGAGTTAT TGAAGAGTTT TCAAGAAACC ATAATAATAA ACTTTCCAAA
ATAGCTTCTG ATATTAGATA TGAAATTTAC ACTTTAGAAA TTGAACTATT AAATCTAAAC
ACTCGTAAGA GAGCAGAGTT AATAATTAGA GAAAACAATT TATATTCGAT AACAGATCAT
AGAGACAACT TATTACAAAT AATTGAAAAA ATATTGTTAG GAGGAGTAAA AATTATTCAG
CACAGATTTA AAGAAGGTAA TGATAAAAAT CATCTCAAAG AAGCAATTCA AGTAAAGAAC
CTATGTGAAA AATATAATTC TTTGTTCATC GTTAATGACA GAGTAGATAT AGCAATGGCA
TCAAATGCAG ACGGTGTTCA TCTTGGGCAA GAAGACATTG ATGTAAAAAC AGCAAGAAAA
TTACTAGGCA GTTCTAAAAT CATTGGTGTT TCAGCAAATA ATTCAACTGA TATCAATAAA
GCTATAAAAG ATGGATGCGA TTACATTGGT ATTGGGCCAG TTTTTCAATC CTTAACAAAA
AAGGGAAAAG AACCACTCGG GGTTGAGAAG ATTAAAACTT TAATAAAAGA TATAAACATT
CCTTGTTTTG CTATAGGAGG TATTAACAAA TTAAATATTT CTTGTTTAAA AAGTCATAGA
ATTAGCAAGG TTGCAGTAGT TTCAGGGCTA CTAAATTCAG AAGATCCAAA AGAAGAAGCT
ATTATTATCT TAAAAAAACT TTCCAATGAA AATTATAGTT AA
 
Protein sequence
MEQPKINQPE DLRISQIIDA NLDRAREGLR VLEDWARFGL GNEDFVIRIK NLRQILGKNH 
LEIYKKSRNH IEDQCKGLSH IEQIHRKSPS KIISSNSARV QEALRVIEEF SRNHNNKLSK
IASDIRYEIY TLEIELLNLN TRKRAELIIR ENNLYSITDH RDNLLQIIEK ILLGGVKIIQ
HRFKEGNDKN HLKEAIQVKN LCEKYNSLFI VNDRVDIAMA SNADGVHLGQ EDIDVKTARK
LLGSSKIIGV SANNSTDINK AIKDGCDYIG IGPVFQSLTK KGKEPLGVEK IKTLIKDINI
PCFAIGGINK LNISCLKSHR ISKVAVVSGL LNSEDPKEEA IIILKKLSNE NYS