Gene EcSMS35_3732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3732 
SymbolugpB 
ID6146056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3801431 
End bp3802747 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content53% 
IMG OID641618558 
Productglycerol-3-phosphate transporter periplasmic binding protein 
Protein accessionYP_001745698 
Protein GI170680964 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCGT TACGTTATAC AGCTTCAGCA CTGGCGCTCG GACTGGCTTT AATGGCAAAT 
GCGCAGGCAG CGACGACCAT TCCGTTCTGG CATTCTATGG AAGGGGAACT GGGTAAAGAG
GTGGATTCTC TGGCCCAACG TTTTAACGCC GAAAACCCGG ATTACAAAAT TGTACCGACC
TATAAAGGCA ACTACGAACA GAATTTAAGC GCGGGGATTG CCGCATTTCG TACCGGCAAC
GCTCCGGCTA TTTTGCAGGT TTATGAAGTT GGCACCGCCA CCATGATGGC GTCGAAAGCC
ATTAAACCGG TATATGACGT GTTTAAAGAG GCGGGGATTC AGTTCGATGA GTCGCAGTTT
GTGCCGACGG TTTCAGGCTA CTACTCCGAC AGCAAAACTG GGCACTTACT CTCCCAGCCG
TTCAACAGCT CGACTCCCGT TCTCTATTAC AACAAAGACG CCTTCAAGAA AGCCGGTTTA
GACCCGGAAC AGCCGCCGAA AACCTGGCAG GATCTGGCGG ACTATGCCGC GAAACTGAAA
GCCTCCGGTA TGAAGTGCGG CTACGCCAGC GGCTGGCAGG GCTGGATCCA ACTGGAAAAC
TTTAGCGCCT GGAACGGTCT GCCGTTTGCC AGCAAAAACA ACGGCTTTGA CGGCACAGAC
GCGGTGCTGG AGTTCAACAA GCCGGAGCAG GTGAAACACA TCGCTATGCT CGAAGAGATG
AACAAGAAGG GCGATTTCAG CTACGTCGGG CGTAAGGATG AATCCACCGA GAAGTTCTAT
AACGGTGATT GCGCGATGAC GACCGCCTCT TCCGGTTCTC TTGCCAACAT TCGCGAGTAC
GCCAAATTTA ACTATGGCGT AGGCATGATG CCTTACGATG CCGATGCGAA AGACGCGCCG
CAAAACGCCA TTATCGGCGG AGCCAGTCTA TGGGTAATGC AGGGTAAAGA TAAAGAAACC
TACACCGGCG TGGCGAAGTT CCTCGACTTC CTCGCAAAGC CAGAAAACGC TGCCGAGTGG
CATCAGAAAA CCGGCTATCT GCCAATCACT AAAGCGGCGT ATGACCTGAC CCGTGAGCAG
GGCTTTTACG AGAAAAACCC AGGAGCGGAT ATTGCCACGC GTCAGATGCT GAACAAGCCA
CCGTTGCCGT TCACCAAAGG TTTGCGTCTG GGCAACATGC CGCAGATCCG CGTGATTGTG
GATGAAGAGC TGGAGAGCGT GTGGACCGGT AAGAAGACAC CACAGCAGGC GCTGGATACT
GCCGTTGAGC GTGGGAACCA GTTACTGCGC CGCTTTGAGA AATCGACGAA GTCTTAA
 
Protein sequence
MKPLRYTASA LALGLALMAN AQAATTIPFW HSMEGELGKE VDSLAQRFNA ENPDYKIVPT 
YKGNYEQNLS AGIAAFRTGN APAILQVYEV GTATMMASKA IKPVYDVFKE AGIQFDESQF
VPTVSGYYSD SKTGHLLSQP FNSSTPVLYY NKDAFKKAGL DPEQPPKTWQ DLADYAAKLK
ASGMKCGYAS GWQGWIQLEN FSAWNGLPFA SKNNGFDGTD AVLEFNKPEQ VKHIAMLEEM
NKKGDFSYVG RKDESTEKFY NGDCAMTTAS SGSLANIREY AKFNYGVGMM PYDADAKDAP
QNAIIGGASL WVMQGKDKET YTGVAKFLDF LAKPENAAEW HQKTGYLPIT KAAYDLTREQ
GFYEKNPGAD IATRQMLNKP PLPFTKGLRL GNMPQIRVIV DEELESVWTG KKTPQQALDT
AVERGNQLLR RFEKSTKS