Gene EcSMS35_4705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4705 
Symbol 
ID6144410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4804667 
End bp4805623 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content50% 
IMG OID641619521 
Productputative sugar ABC transporter, periplasmic sugar-binding protein 
Protein accessionYP_001746629 
Protein GI170679669 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAAC GCTTACTTGT AGTCTCTGCA GTCTCGGCAG CCATGTCGTC TATGGCGTTG 
GCCGCTCCAT TAACCGTTGG ATTTTCGCAG GTCGGATCGG AATCCGGCTG GCGCGCCGCA
GAAACCAATG TGGCGAAAAG TGAAGCCGAA AAGCGCGGAA TTACGTTGAA AATTGCCGAT
GGTCAGCAAA AGCAGGAAAA CCAGATTAAA GCGGTACGTT CCTTCGTCGC GCAAGGGGTG
GATGCGATCT TTATCGCTCC AGTGGTAGCG ACCGGTTGGG AGCCGGTATT AAAAGAGGCG
AAAGATGCCG AAATCCCGGT CTTCTTGCTT GACCGTTCCA TCGATGTGAA AGACAAATCT
CTCTATATGA CCACTGTCAC CGCCGACAAC ATCCTCGAAG GCAAGTTGAT TGGTGACTGG
CTGGTAAAAG AAGTGAATGG CAAACCATGC AACGTGGTGG AGTTGCAGGG CACCGTTGGA
GCCAGCGTCG CCATTGACCG TAAGAAAGGC TTTGCCGAAG CCATTAAGAA TGCGCCAAAT
ATCAAAATTA TCCGCTCGCA GTCAGGTGAC TTCACCCGCA GCAAAGGCAA AGAAGTGATG
GAGAGCTTTA TCAAAGCGGA AAACAACGGC AAAAACATCT GCATGGTTTA CGCCCATAAC
GATGACATGG TAATTGGTGC AATTCAGGCA ATTAAAGAAG CGGGCCTGAA ACCGGGCAAA
GATATCCTGA CAGGCTCTAT CGACGGCGTG CCGGACATCT ACAAAGCGAT GATTGATGGC
GAAGCGAACG CCAGCGTTGA ACTAACGCCG AATATGGCAG GCCCTGCTTT TGACGCGCTG
GAGAAATACA AAAAAGACGG CACCATGCCT GAAAAGCTGA CGCTGACCAA GTCCACCCTT
TACCTGCCTG ATACCGCAAA AGAAGAGTTA GAGAAGAAGA AAAATATGGG GTATTGA
 
Protein sequence
MWKRLLVVSA VSAAMSSMAL AAPLTVGFSQ VGSESGWRAA ETNVAKSEAE KRGITLKIAD 
GQQKQENQIK AVRSFVAQGV DAIFIAPVVA TGWEPVLKEA KDAEIPVFLL DRSIDVKDKS
LYMTTVTADN ILEGKLIGDW LVKEVNGKPC NVVELQGTVG ASVAIDRKKG FAEAIKNAPN
IKIIRSQSGD FTRSKGKEVM ESFIKAENNG KNICMVYAHN DDMVIGAIQA IKEAGLKPGK
DILTGSIDGV PDIYKAMIDG EANASVELTP NMAGPAFDAL EKYKKDGTMP EKLTLTKSTL
YLPDTAKEEL EKKKNMGY