Gene EcSMS35_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4031 
SymboluhpT 
ID6144251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4119215 
End bp4120606 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content53% 
IMG OID641618856 
Productsugar phosphate antiporter 
Protein accessionYP_001745994 
Protein GI170680555 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00881] phosphoglycerate transporter family protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.505576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCTT TCTTAAACCA GGTTCGCAAG CCGACCCTGG ACCTTCCGCT CGAAGTGCGG 
CGCAAAATGT GGTTCAAACC GTTCATGCAA TCCTACCTGG TGGTCTTTAT CGGCTACCTG
ACGATGTACC TGATTCGCAA GAACTTTAAC ATCGCGCAGA ACGATATGAT TTCGACCTAC
GGGTTGAGCA TGACGCAGCT GGGGATGATC GGCCTGGGTT TCTCCATCAC TTATGGCGTG
GGTAAAACGC TGGTTTCCTA CTACGCCGAC GGCAAAAACA CCAAACAATT CCTGCCGTTC
ATGCTGATCC TGTCTGCCAT TTGTATGCTG GGCTTCAGCG CCAGTATGGG CAGCGGCTCG
GTTAGCCTGT TCCTGATGAT TGCCTTCTAC GCCTTAAGCG GCTTTTTCCA GAGCACCGGC
GGTTCGTGCA GTTACTCCAC CATCACCAAA TGGACGCCGC GTCGTAAGCG CGGGACCTTC
CTCGGTTTCT GGAACATTTC TCACAACCTT GGCGGTGCAG GCGCAGCAGG TGTGGCGCTG
TTCGGCGCAA ATTACCTGTT CGATGGTCAT GTTATCGGCA TGTTTATCTT CCCGTCGATT
ATCGCGCTGA TTGTTGGTTT TATCGGCCTG CGTTACGGCA GCGACTCCCC GGAATCTTAT
GGCCTCGGCA AAGCTGAAGA GCTGTTCGGC GAGGAGATCA GTGAAGAGGA TAAAGAGACA
GAATCTACCG ATATGACCAA GTGGCAGATC TTTGTTGAGT ATGTGCTGAA AAACAAAGTG
ATCTGGCTGC TGTGCTTTGC CAACATTTTC CTCTATGTGG TGCGTATTGG TATCGACCAG
TGGTCAACCG TGTATGCGTT CCAGGAACTG AAACTCTCTA AAGCGGTGGC GATTCAGGGC
TTTACGCTGT TTGAAGCTGG TGCGCTGGTC GGTACGCTGC TGTGGGGCTG GCTCTCTGAC
CTGGCGAACG GTCGCCGTGG CCTGGTGGCC TGTATCGCGC TGGCGCTGAT TATCGCCACG
CTCGGTGTGT ATCAACACGC CAGCAACCAA TATATCTACC TGGCTTCTCT CTTTGCGTTG
GGTTTCCTGG TCTTTGGCCC GCAATTGTTG ATTGGTGTAG CTGCTGTTGG CTTTGTACCT
AAAAAAGCGA TTGGCGCTGC CGATGGTATT AAAGGCACCT TCGCTTACCT GATTGGTGAC
AGCTTTGCCA AGTTAGGTCT GGGAATGATT GCTGATGGGA CGCCGGTATT TGGCCTTACC
GGCTGGGCAG GCACCTTCGC CGCGCTGGAT ATCGCCGCGA TTGGTTGTAT CTGCCTGATG
GCGATAGTGG CGGTGATGGA AGAACGCAAA ATCCGCCGCG AGAAAAAAAT TCAGCAGTTG
ACAGTGGCAT AA
 
Protein sequence
MLAFLNQVRK PTLDLPLEVR RKMWFKPFMQ SYLVVFIGYL TMYLIRKNFN IAQNDMISTY 
GLSMTQLGMI GLGFSITYGV GKTLVSYYAD GKNTKQFLPF MLILSAICML GFSASMGSGS
VSLFLMIAFY ALSGFFQSTG GSCSYSTITK WTPRRKRGTF LGFWNISHNL GGAGAAGVAL
FGANYLFDGH VIGMFIFPSI IALIVGFIGL RYGSDSPESY GLGKAEELFG EEISEEDKET
ESTDMTKWQI FVEYVLKNKV IWLLCFANIF LYVVRIGIDQ WSTVYAFQEL KLSKAVAIQG
FTLFEAGALV GTLLWGWLSD LANGRRGLVA CIALALIIAT LGVYQHASNQ YIYLASLFAL
GFLVFGPQLL IGVAAVGFVP KKAIGAADGI KGTFAYLIGD SFAKLGLGMI ADGTPVFGLT
GWAGTFAALD IAAIGCICLM AIVAVMEERK IRREKKIQQL TVA