Gene EcSMS35_2700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2700 
Symbol 
ID6144918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2772184 
End bp2773695 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content56% 
IMG OID641617571 
Productputative sugar ABC transporter, ATP-binding protein 
Protein accessionYP_001744736 
Protein GI170683588 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACGG CAACAGAGGC AGTCCCGGTA GCAAAAGTGG TGGCAGGAAA TAAGCGTTAT 
CCCGGCGTCG TTGCGTTGGA TAACGTTAAC TTCACGCTCA ATAAAGGCGA AGTTCGTGCG
CTGTTAGGCA AAAACGGCGC GGGCAAATCG ACCCTCATTC GAATGCTTAC CGGTAGCGAA
CGTCCGGATA GCGGTGATAT CTGGATTGGC GAGACGCGAC TGGAAGGTGA CGAAACTACG
CTGACTCGCC GTGCCGCTGA ACTGGGGGTT CGTGCGGTTT ATCAGGAATT AAGTCTGGTG
GAAGGGCTGA CGGTGGCGGA AAACCTCTGC CTCGGTCAGT GGCCCCGCCG CAACGGCATG
ATTGATTACC TGCAAATGGC GCAGGATGCC CAACGTTGCT TACAGGCGCT GGGCGTTGAC
GTTAGCCCTG AACAACTTGT TTCAACGCTA AGCCCGGCGC AAAAGCAACT GGTGGAAATT
GCGCGGGTGA TGAAGGGCGA GCCGCGCGTG GTCATTCTTG ATGAACCCAC CAGCTCGCTT
GCCAGTGCGG AAGTTGAACT GGTGATCAGC GCGGTGAAAA AGATGTCGGC ACTGGGCGTG
GCGGTGATTT ATGTCAGCCA CCGGATGGAA GAAATTCGCC GCATTGCCTC CTGTGCCACC
GTTATGCGCG ATGGTCAGGT GGCGGGCGAT GTGATGCTCG AAAACACCTC TACGCATCAT
ATTGTGTCGC TAATGCTCGG GTGCGATCAC GTTGATATTG CGCCGGTTGC CCCTCAGGAA
ATTATGGATC AGGCCGTGCT GGAAGTCCGT GCGTTACGCC ATAAGCCCAA GCTGGAGGAT
ATCAGCTTTA CGCTACGTCG CGGCGAAGTG CTCGGCATTG CTGGCCTGCT GGGGGCAGGG
CGCAGTGAAT TGTTGAAAGC CATAGTTGGG CTGGAGACGT ATGAACAGGG CGAAATTGTT
ATCAACGGCG AGAAAATCAC GTGCCCCGAT TACGGCGACA TGCTGAAACG CGGCATTGGA
TATACGCCAG AAAACCGCAA AGAAGCGGGG ATCATTCCCT GGCTGGGCGT TGACGAAAAT
ACAGTGCTGA CCAATCGGCA AAAAATCAGC GCCAACGGTG TGCTGCAATG GTCCACCATC
CGCCGCCTGA CCGAAGAGGT GATGCAGCGG ATGACGGTCA AGGCCGCCAG TAGCGAAACG
CCCATCGGCA CGCTTTCTGG CGGCAATCAG CAAAAAGTGG TGATCGGTCG TTGGGTCTAT
GCCGCCAGCC AGATTTTGTT GCTCGACGAG CCAACGCGTG GCGTCGATAT CGAAGCCAAA
CAGCAGATTT ACCGTATTGT CCGCGAGCTG GCTGCCGAAG GAAAAAGCGT GGTGTTTATC
TCCAGTGAAG TGGAGGAGTT ACCGTTGGTG TGCGACCGCA TTCTGTTGTT ACAGCACGGT
ACGTTCTCGC AGGAGTTTCA CTCTCCGGTC AATGTGGATG AGCTGATGTC CGCCATTCTG
TCTGTGCACT GA
 
Protein sequence
MFTATEAVPV AKVVAGNKRY PGVVALDNVN FTLNKGEVRA LLGKNGAGKS TLIRMLTGSE 
RPDSGDIWIG ETRLEGDETT LTRRAAELGV RAVYQELSLV EGLTVAENLC LGQWPRRNGM
IDYLQMAQDA QRCLQALGVD VSPEQLVSTL SPAQKQLVEI ARVMKGEPRV VILDEPTSSL
ASAEVELVIS AVKKMSALGV AVIYVSHRME EIRRIASCAT VMRDGQVAGD VMLENTSTHH
IVSLMLGCDH VDIAPVAPQE IMDQAVLEVR ALRHKPKLED ISFTLRRGEV LGIAGLLGAG
RSELLKAIVG LETYEQGEIV INGEKITCPD YGDMLKRGIG YTPENRKEAG IIPWLGVDEN
TVLTNRQKIS ANGVLQWSTI RRLTEEVMQR MTVKAASSET PIGTLSGGNQ QKVVIGRWVY
AASQILLLDE PTRGVDIEAK QQIYRIVREL AAEGKSVVFI SSEVEELPLV CDRILLLQHG
TFSQEFHSPV NVDELMSAIL SVH