Gene A9601_14731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14731 
SymbolthiE 
ID4718194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1256992 
End bp1258047 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content30% 
IMG OID640079194 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001009863 
Protein GI123969005 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.161501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATT CCAATACTAA AGACCATGAA GATTTAAGAA TTTATCAGAT TATTGACGCT 
AATCTAGATA GAGCAAGAGA AGGACTAAGA GTATTAGAGG ATTGGGCTAG ATTTGGTCTA
GGCAAAGAAA AATATGTTGA AAAAATTAAA AATTTTAGAC AAATTTTAGG AAAAAATCAT
TTAGAAGTTT ATAAACAATC TAGAAATCAG ATCGAAGACA ATTGTAAAGG ATTGACTCAT
CAAGAGCAAT TAAACAGAAA AACTTCTGAG CAAATTATAA GTTCTAATTC AGCAAGAGTT
CAAGAAGCAT TACGAGTCAT AGAGGAATTC TCAAGGCTGC ACAATAATGA GCTTTCAAAA
ATCGCTTCTG AAATTAGATA TGAAATTTAT ACTGTTGAAA TTGACCTATT AAGTTTTAGC
AAGTTTAAGA AGTCGGAGAA AATATTAAAA GAAAATGACT TATATGTAAT CACAGATCAA
AAGGACAATT TATTAGAAAT AATTGAAGAG ATTTTAATTG CGGGAGTAAA AATTATTCAG
CATAGATTTA AAACGGGAAC TGATCAAGAT CATCTTCAAG AAGCAATTGA GATTAAAAAT
CTATGTAAAA GATATAATTC TTTGTTCATA GTTAACGATA GACTTGATAT AGCTCTAGCA
TCTAACGCGG ATGGGATTCA TCTTGGACAA GACGATTTAG ACTTAAAAAC CACAAGAAAG
CTATTTGGAT ATTCAAAAAT AATCGGTATA AGTGCAAATA ATGCAATTGA TATTTCAAAT
GCTCTTGACG AGGGTTGTGA CTACATAGGG ATAGGGCCAG TATTTGAAAC TACGACAAAA
AAGAATAAAA AACCTTTAGG TATTGAAAAT ATCAAAACAT TAACAAAGGA TTTAAATATT
CCTTGGTTTG CTATCGGAGG AATCAAGTCA AATAACATTT CATATTTAAA AAGAAATGGG
TTTAAAAAAG TTGCCTTAGT TTCGGAATTA ATGAATTCTG AAGATCCTAA AGAAGACGCT
ATGATGATTT TAAAAGAATT GTCTCATGAA AATTAG
 
Protein sequence
MLNSNTKDHE DLRIYQIIDA NLDRAREGLR VLEDWARFGL GKEKYVEKIK NFRQILGKNH 
LEVYKQSRNQ IEDNCKGLTH QEQLNRKTSE QIISSNSARV QEALRVIEEF SRLHNNELSK
IASEIRYEIY TVEIDLLSFS KFKKSEKILK ENDLYVITDQ KDNLLEIIEE ILIAGVKIIQ
HRFKTGTDQD HLQEAIEIKN LCKRYNSLFI VNDRLDIALA SNADGIHLGQ DDLDLKTTRK
LFGYSKIIGI SANNAIDISN ALDEGCDYIG IGPVFETTTK KNKKPLGIEN IKTLTKDLNI
PWFAIGGIKS NNISYLKRNG FKKVALVSEL MNSEDPKEDA MMILKELSHE N