Gene EcSMS35_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1094 
Symbol 
ID6146749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1109201 
End bp1110493 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID641615978 
Productd-galactonate transporter 
Protein accessionYP_001743170 
Protein GI170680666 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00881] phosphoglycerate transporter family protein
[TIGR00893] d-galactonate transporter 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTC CCGTTAATGC AGCAAAGCCG GGGCGTCGGC GTTATCTGAC GCTGGTGATG 
ATCTTTATTA CGGTGGTCAT TTGTTATGTT GACCGCGCTA ACCTGGCCGT GGCTGCCGCC
CATATTCAGG AAGAGTTCGG CATTACCAAA GCGGAAATGG GCTATGTATT TTCGGCCTTC
GCCTGGCTTT ATACGCTATG TCAGATCCCC GGCGGTTGGT TTTTAGATCG CGTAGGTTCA
CGCGTGACTT ATTTTATTGC GATATTTGGC TGGTCAGTGG CGACGTTATT CCAGGGCTTT
GCCACGGGAT TAATGTCATT AATTGGTCTG CGCGCGATAA CCGGTATTTT CGAAGCGCCC
GCTTTCCCGA CCAATAACCG GATGGTGACC AGCTGGTTCC CGGAACATGA ACGCGCTTCT
GCCGTTGGTT TTTATACGTC TGGTCAGTTT GTCGGTCTGG CGTTTCTGAC TCCGCTGCTG
ATCTGGATTC AGGAGATGTT GAGCTGGCAC TGGGTGTTCA TTGTCACCGG TGGTATCGGC
ATTATCTGGT CGCTGATTTG GTTTAAGGTT TATCAGCCGC CGCGCCTGAC CAAAGGCATC
AGCAAAGCTG AACTGGATTA CATTCGTGAT GGCGGCGGTC TGGTGGATGG TGATGCGCCG
GTGAAAAAAG AGGCACGTCA GCCGTTAACA GCCAAAGACT GGAAACTGGT GTTCCATCGT
AAACTGATCG GCGTCTATCT TGGACAATTT GCGGTGACTT CTACACTGTG GTTTTTTTTG
ACCTGGTTCC CGAACTATTT AACCCAGGAA AAAGGAATCA CGGCGCTGAA AGCAGGCTTT
ATGACCTCGG TACCATTCCT CGCGGCGTTT GTCGGCGTCC TGCTCTCTGG CTGGGTCGCG
GATCTGCTGG TACGTAAGGG CTTTTCACTG GGCTTTGCGC GTAAAACGCC GATTATCTGC
GGCTTGCTGA TCTCCACCTG CATTATGGGC GCTAACTACA CTAACGATCC GATGATGATT
ATGTGCCTGA TGGCGCTGGC ATTCTTCGGC AACGGTTTTG CTTCGATTAC CTGGTCGCTG
GTCTCTTCTC TGGCACCGAT GCGCCTGATT GGTTTAACCG GTGGCGTATT TAACTTTGTT
GGCGGTCTGG GCGGCATCAC CGTTCCGCTG GTGGTGGGGT ACCTGGCGCA GGGTTACGGT
TTCGCACCTG CACTGGTTTA TATCTCCGCC GTCGCGTTGA TTGGCGCGCT CTCTTACATT
CTGATGGTGG GCGATGTGAA GCGCGTTGGA TAA
 
Protein sequence
MDIPVNAAKP GRRRYLTLVM IFITVVICYV DRANLAVAAA HIQEEFGITK AEMGYVFSAF 
AWLYTLCQIP GGWFLDRVGS RVTYFIAIFG WSVATLFQGF ATGLMSLIGL RAITGIFEAP
AFPTNNRMVT SWFPEHERAS AVGFYTSGQF VGLAFLTPLL IWIQEMLSWH WVFIVTGGIG
IIWSLIWFKV YQPPRLTKGI SKAELDYIRD GGGLVDGDAP VKKEARQPLT AKDWKLVFHR
KLIGVYLGQF AVTSTLWFFL TWFPNYLTQE KGITALKAGF MTSVPFLAAF VGVLLSGWVA
DLLVRKGFSL GFARKTPIIC GLLISTCIMG ANYTNDPMMI MCLMALAFFG NGFASITWSL
VSSLAPMRLI GLTGGVFNFV GGLGGITVPL VVGYLAQGYG FAPALVYISA VALIGALSYI
LMVGDVKRVG