Gene P9303_19391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19391 
SymbolthiE 
ID4777528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1706747 
End bp1707808 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content55% 
IMG OID640087449 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001017946 
Protein GI124023639 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.706748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGA TGCCTGTCGC CCCGATCGCA GATCTACGTG TGGCTCAGCT GATCGATGCC 
AACCTCGATC GAGCTCGAGA AGGACTGCGA GTCGTCGAAG ACTGGTGCCG CTTCGGCCTA
GATCGTGAAG ACCTTGTGGT GACTCTCAAA GACTGGCGTC AGCGACTGGG TCGCCATCAT
CACGACAGCT ACAAACAGGC ACGCTCCACT GCTACAGATC AAGGGATCGG CCTCAGTCAT
CCTGCTCAGC AAGAGCGACA CGAACCATGG CATGTTGTGG CAGCCAACTG TGCACGCGTT
CAAGAAGCTC TACGCGTACT GGAAGAGTTC GCCCGTCAGC CAGATCCTCA GCTGGCTGCC
AGCGCTGCTG CAATCCGCTA TGGCCTCTAC GACCTAGAGG TGACCGTGCT GCAGGCCAAC
GCAGGCAAAA AGAGACGCCA ACAACTGCAG GCCTGCCATC TTTGCCTGAT TACGACATCA
CAATCCGATC TAGCCAACAA CGATCTATTC AGAACAGTGA GCGCAGCACT AGTCGCTGGC
ATCGACATGG TGCAATACCG CAATAAAGAA GCTAGCGACT TGCAACGACT GACTCAGGCA
AAAGAGCTGG CCAGCCTATG CAGAAAGCAT GGGGCGCTAT TCATCGTTAA TGACCGAATC
GACTTAGCCC TTGCAGTGGA CGCCGATGGC GTTCATCTCG GCCAGGACGA CCTCCCCACA
GACGTAGCCA GGGGACTGAT CGGCAGCGAA CGACTACTGG GTCGAAGCAC ACAGTTCCTT
GCCCAGCTTC AAAAAGCTGA AGCAGAAGGT TGCGACTATC TAGGAGTAGG GCCTGTCAAC
AGCACAGCCA CAAAACCGGA ACGACAACCA ATTGGGCTTG CCTATGTGAA GGAGGCATCT
AAAGCCACCC AGCTACCTTG GTTTGCCATT GGTGGCATCA ACATCTCAAA CCTAGAAGCA
GTACGTCAAG CCGGAGCAAA GCGAATCGCT GTGATCGGAG CGATCATGAA TTCCAAAGAT
CCTGCCGCTA CCAGCCTTCA ACTACTGGAG GCTCTGAGAT GA
 
Protein sequence
MKSMPVAPIA DLRVAQLIDA NLDRAREGLR VVEDWCRFGL DREDLVVTLK DWRQRLGRHH 
HDSYKQARST ATDQGIGLSH PAQQERHEPW HVVAANCARV QEALRVLEEF ARQPDPQLAA
SAAAIRYGLY DLEVTVLQAN AGKKRRQQLQ ACHLCLITTS QSDLANNDLF RTVSAALVAG
IDMVQYRNKE ASDLQRLTQA KELASLCRKH GALFIVNDRI DLALAVDADG VHLGQDDLPT
DVARGLIGSE RLLGRSTQFL AQLQKAEAEG CDYLGVGPVN STATKPERQP IGLAYVKEAS
KATQLPWFAI GGINISNLEA VRQAGAKRIA VIGAIMNSKD PAATSLQLLE ALR