Gene EcSMS35_4706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4706 
Symbol 
ID6143827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4805756 
End bp4807258 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content56% 
IMG OID641619522 
Productputative sugar ABC transporter, ATP-binding protein 
Protein accessionYP_001746630 
Protein GI170681344 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.706122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTA ACCAACACCA GGAAATCCTC CGCACCGAAG GATTAAGTAA ATTTTTCCCC 
GGCGTCAAAG CGTTAGACAA CGTTGATTTC AGCCTGCGTC GCGGCGAAAT TATGGCGCTG
CTCGGTGAAA ACGGGGCGGG AAAATCAACG CTAATCAAAG CCTTAACCGG TGTCTACCAC
GCCGATCGTG GCACCATCTG GCTGGAAGGC CAGACTATCT CACCGAAAAA CACCGCCCAC
GCACAACAAC TCGGCATCGG CACCGTCTAT CAGGAAGTCA ACCTGCTACC CAATATGTCG
GTCGCTGATA ATCTATTCAT AGGTCGCGAA CCCAAACGTT TCGGCCTTCT ACGCCGTAAA
GAGATGGAAA AGCGCGCCAC CGAACTGATG GCATCTTACG GTTTCTCCCT CGACGTGCGC
GAACCGCTCA ACCGCTTTTC AGTCGCGATG CAGCAAATCG TCGCCATTTG CCGGGCCATC
GATCTCTCCG CCAAAGTGCT GATCCTCGAT GAACCCACCG CCAGCCTCGA CACTCAGGAA
GTAGAGTTAC TGTTTGGCCT GATGCGTCAG TTGCGCGATC GCGGCGTCAG CCTGATCTTC
GTTACTCACT TTCTCGATCA GGTCTATCAG GTCAGCGATC GGATCACCGT CTTACGCAAC
GGCAGTTTCG TAGGCTGTCG GGAAACCCGC GAGCTACCAC AGATCGAACT GGTAAAAATG
ATGCTGGGGC GCGAGCTGGA CACCCACGCG CTACAGCGTG CCGGACGAAC ATTGTTGAGC
GACAAACCCG TCGCCGCGTT CAAAAATTAC GGCAAAAAAG GAATGATCGC ACCGTTTGAT
CTCGAAGTGC GCCCCGGCGA GATCGTCGGT CTGGCAGGCT TGCTGGGATC AGGACGTACC
GAAACCGCCG AAGTGATCTT CGGTATTAAA CCTGCTGACA GCGGCACGGC GTTGATCAAA
GGCAAACCGC AAACCCTGCG ATCGCCACAT CAGGCTTCGG TACTTGGCAT TGGCTTTTGC
CCGGAAGACA GGAAAACCGA TGGCATCATC GCCGCCGCCT CGGTGCGGGA AAATATCATT
CTCGCGCTAC AAGCCCAGCG CGGCTGGCTG CGACCGATCT CCCGCAAAGA ACAGCAAGAG
ATTGCCGAAC GCTTTATCCG CCAGCTTGGC ATTCGCACCC CTTCCACTGA ACAACCGATT
GAATTTCTCT CCGGCGGCAA TCAGCAAAAA GTGCTGCTCT CTCGCTGGCT GCTGACTCGT
CCGCAATTTC TGATCCTCGA CGAGCCAACG CGTGGAATTG ACGTTGGTGC GCACGCCGAG
ATCATCCGCC TGATTGAAAC GCTATGCGCT GACGGTCTGG CGCTGCTGGT GATCTCCTCT
GAACTGGAAG AGCTGGTGGG CTATGCCGAT CGGGTGATTA TCATGCGCGA TCGCAAACAG
GTGGCGGAGA TCCCGCTGGC AGCGCTTTCC GTTCCGGCGA TCATGAACGC CATTGCGGCG
TAA
 
Protein sequence
MNTNQHQEIL RTEGLSKFFP GVKALDNVDF SLRRGEIMAL LGENGAGKST LIKALTGVYH 
ADRGTIWLEG QTISPKNTAH AQQLGIGTVY QEVNLLPNMS VADNLFIGRE PKRFGLLRRK
EMEKRATELM ASYGFSLDVR EPLNRFSVAM QQIVAICRAI DLSAKVLILD EPTASLDTQE
VELLFGLMRQ LRDRGVSLIF VTHFLDQVYQ VSDRITVLRN GSFVGCRETR ELPQIELVKM
MLGRELDTHA LQRAGRTLLS DKPVAAFKNY GKKGMIAPFD LEVRPGEIVG LAGLLGSGRT
ETAEVIFGIK PADSGTALIK GKPQTLRSPH QASVLGIGFC PEDRKTDGII AAASVRENII
LALQAQRGWL RPISRKEQQE IAERFIRQLG IRTPSTEQPI EFLSGGNQQK VLLSRWLLTR
PQFLILDEPT RGIDVGAHAE IIRLIETLCA DGLALLVISS ELEELVGYAD RVIIMRDRKQ
VAEIPLAALS VPAIMNAIAA