Gene EcSMS35_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1659 
SymbollsrA 
ID6145712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1646597 
End bp1648132 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content53% 
IMG OID641616535 
Productautoinducer-2 ABC transporter, ATP-binding protein LsrA 
Protein accessionYP_001743713 
Protein GI170679722 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.935771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACGA GTGATACCCG CGCGTTACCG CTACTTTGCG CCCGCTCGGT TTATAAACAG 
TATTCAGGAG TTAATGTCCT GAAAGGCATC GATTTTACGT TGCATCAGGG GGAGGTCCAC
GCCCTGCTCG GCGGCAATGG TGCCGGTAAA TCGACGTTAA TGAAGATTAT TGCCGGTATT
ACCCCTGCTG ATAGCGGTAC GCTGGAGATT GGGGGCAACA ACTACGTCAG ATTAACGCCA
GTTCATGCTC ATCAGCTGGG CATTTATCTC GTTCCCCAGG AACCGCTGCT TTTTCCAAGC
CTGTCGATAA AAGAAAACAT CCTGTTTGGT CTGGCAAAAA AACAGCTCTC CATACAGAAA
ATGAATAACT TGCTGGCGGC GCTGGGCTGC CAGTTTGATC TGCATAGTCT GGCAGGATCG
CTGGATGTCG CCGATCGCCA AATGGTGGAA ATCCTCCGCG GGCTGATGCG CGATTCGCGA
ATTCTGATCC TCGATGAACC TACCGCCTCG CTTACTCCCG CCGAAACCGA ACGTTTGTTT
ATTCGCTTGC GGGAGCTGCT TGCTACTGGC GTGGGTATTG TTTTTATCTC GCATAAGCTG
CCGGAAATTC GCCAGATTGC CGACCGAATT AGCGTGATGC GCGACGGAAC CATCGCCTTA
AGCGGCAAAA CCAGCGAACT GTCTACCGAC GACATTATTC AGGCCATTAC GCCAGCGCTA
CGGGAAAAAT CGCTCTCTGC CAGCCAAAAA TTATGGCTGG AATTACCTGG CAACCGCCCA
CAACATGCCG CGGGAACGCC GGTGCTGACA CTGGAAAATC TGACCGGCGA AGGTTTCAGG
AATGTCAGCC TGACGCTCAA TGCCGGAGAA ATTCTGGGCC TGGCTGGGCT GGTGGGAGCC
GGACGCACAG AACTGGCCGA GACGCTCTAT GGTCTGCGTA CTTTGCGTGG CGGACGCATT
ATGCTGAATG GTAAAGAGAT CAATAGATTA TCCACCGGAG AACGTTTACT GCGCGGTCTG
GTTTATCTGC CGGAAGATCG CCAGTCATCC GGACTTAATC TCGATGCTTC ACTGGCGTGG
AACGTCTGCG CCCTTACTCA TAACCTTCGT GGATTCTGGG CGAAAACCGC GAAAGATAAT
GCCACCCTGG AACGATATCG TCGGGCGCTG AATATTAAAT TTAACCAACC GGAACAAGCT
GCGCGGACCT TATCCGGCGG CAACCAGCAA AAAATCCTGA TTGCCAAATG CCTGGAAGCC
TCGCCGCAAG TATTGATTGT CGATGAGCCG ACGCGCGGCG TGGATGTCTC GGCGCGTAAT
GATATCTACC AACTGTTGCG CAGCATCGCC GCACAGAATG TGGCTGTGCT GCTTATCTCC
TCTGATCTCG AAGAGATCGA ACTGATGGCA GATCGCGTGT ATGTGATGCA TCAGGGCGAA
ATTGCCCACT CTGCACTGAC CGGGCGCGAT ATTAATGTCG AGACCATTAT GCGCGTTGCC
TTCGGCGATA GTCAGCGTCA GGAGGCGTCA TGCTGA
 
Protein sequence
MQTSDTRALP LLCARSVYKQ YSGVNVLKGI DFTLHQGEVH ALLGGNGAGK STLMKIIAGI 
TPADSGTLEI GGNNYVRLTP VHAHQLGIYL VPQEPLLFPS LSIKENILFG LAKKQLSIQK
MNNLLAALGC QFDLHSLAGS LDVADRQMVE ILRGLMRDSR ILILDEPTAS LTPAETERLF
IRLRELLATG VGIVFISHKL PEIRQIADRI SVMRDGTIAL SGKTSELSTD DIIQAITPAL
REKSLSASQK LWLELPGNRP QHAAGTPVLT LENLTGEGFR NVSLTLNAGE ILGLAGLVGA
GRTELAETLY GLRTLRGGRI MLNGKEINRL STGERLLRGL VYLPEDRQSS GLNLDASLAW
NVCALTHNLR GFWAKTAKDN ATLERYRRAL NIKFNQPEQA ARTLSGGNQQ KILIAKCLEA
SPQVLIVDEP TRGVDVSARN DIYQLLRSIA AQNVAVLLIS SDLEEIELMA DRVYVMHQGE
IAHSALTGRD INVETIMRVA FGDSQRQEAS C