Gene EcSMS35_4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4055 
Symbol 
ID6145193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4147203 
End bp4148495 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID641618881 
Productd-galactonate transporter 
Protein accessionYP_001746019 
Protein GI170682764 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00881] phosphoglycerate transporter family protein
[TIGR00893] d-galactonate transporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTC CCGTTAATGC AGCAAAGCCG GGGCGTCGGC GTTATCTGAC GCTGGTGATG 
ATCTTTATTA CGGTGGTCAT TTGTTATGTT GACCGCGCTA ACCTGGCCGT GGCTTCCGCC
CATATTCAGG AAGAGTTCGG TATTACGAAA GCGGAAATGG GCTATGTATT TTCGGCCTTC
GCCTGGCTTT ATACGCTATG TCAGATCCCC GGCGGTTGGT TTTTAGATCG CGTAGGTTCA
CGCGTGACTT ATTTTATTGC GATATTTGGC TGGTCAGTGG CGACGTTATT CCAGGGCTTT
GCCACGGGAT TAATGTCATT AATTGGTCTG CGCGCAATAA CCGGTATTTT CGAAGCGCCC
GCTTTCCCGA CCAATAACCG GATGGTGACC AGCTGGTTCC CGGAACATGA ACGCGCTTCT
GCCGTTGGTT TTTATACGTC TGGTCAGTTT GTCGGTCTGG CGTTTCTGAC TCCGCTGCTG
ATCTGGATTC AGGAGATGTT GAGCTGGCAC TGGGTGTTCA TTGTCACCGG TGGTATCGGC
ATTATCTGGT CGCTGATTTG GTTTAAGGTT TATCAGCCGC CGCGTCTGAC CAAAGGCATC
AGCAAAGCTG AACTGGATTA CATTCGTGAT GGCGGCGGTC TGGTGGATGG CGATGCGCCG
GTGAAGAAAG AGGCGCGTCA GCCGTTAACA GCCAAAGACT GGAAACTGGT GTTCCATCGT
AAACTGATCG GCGTCTATCT TGGGCAATTT GCGGTGGCTT CTACACTGTG GTTTTTCTTA
ACCTGGTTCC CGAACTATTT AACCCAGGAA AAAGGAATCA CGGCGCTGAA AGCAGGCTTT
ATGACCACGG TGCCATTCCT CGCGGCGTTT GTCGGCGTCC TGCTCTCTGG CTGGGTCGCG
GATCTGCTGG TACGTAAGGG CTTTTCACTG GGCTTTGCGC GTAAAACGCC GATTATCTGC
GGCTTGCTGA TCTCCACCTG CATTATGGGC GCTAACTACA CTAACGATCC GATGATGATT
ATGTGCCTGA TGGCGCTGGC ATTCTTCGGC AACGGTTTTG CTTCGATTAC CTGGTCGCTG
GTTTCTTCTC TGGCACCGAT GCGCCTGATT GGTTTAACCG GCGGCGTGTT TAACTTTGCC
GGTGGTTTGG GCGGCATCAC CGTTCCGCTG GTGGTGGGGT ACCTGGCGCA GGGTTACGGT
TTCGCGCCTG CACTGGTTTA TATCTCCGCC GTCGCGTTGA TTGGCGCGCT CTCTTACATC
CTGCTGGTGG GCGATGTGAA GCGCGTTGGC TAA
 
Protein sequence
MDIPVNAAKP GRRRYLTLVM IFITVVICYV DRANLAVASA HIQEEFGITK AEMGYVFSAF 
AWLYTLCQIP GGWFLDRVGS RVTYFIAIFG WSVATLFQGF ATGLMSLIGL RAITGIFEAP
AFPTNNRMVT SWFPEHERAS AVGFYTSGQF VGLAFLTPLL IWIQEMLSWH WVFIVTGGIG
IIWSLIWFKV YQPPRLTKGI SKAELDYIRD GGGLVDGDAP VKKEARQPLT AKDWKLVFHR
KLIGVYLGQF AVASTLWFFL TWFPNYLTQE KGITALKAGF MTTVPFLAAF VGVLLSGWVA
DLLVRKGFSL GFARKTPIIC GLLISTCIMG ANYTNDPMMI MCLMALAFFG NGFASITWSL
VSSLAPMRLI GLTGGVFNFA GGLGGITVPL VVGYLAQGYG FAPALVYISA VALIGALSYI
LLVGDVKRVG