Gene EcSMS35_3862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3862 
SymboldppB 
ID6144257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3931243 
End bp3932262 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content56% 
IMG OID641618688 
Productdipeptide transporter permease DppB 
Protein accessionYP_001745828 
Protein GI170681195 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCAGT TTATTCTCCG ACGTTTGGGA CTCGTCATCC CCACGTTTAT CGGTATTACC 
CTTCTCACAT TTGCCTTTGT CCACATGATC CCGGGCGATC CGGTGATGAT CATGGCGGGC
GAGCGTGGGA TCTCCCCAGA GCGTCACGCG CAATTGCTGG CTGAACTCGG CTTAGATAAA
CCGATGTGGC AGCAGTATCT CCATTACATT TGGGGCGTAA TGCACGGTGA TTTAGGCATT
TCCATGAAAA GCCGAATTCC GGTATGGGAA GAGTTCGTGC CGCGCTTTCA GGCTACGCTG
GAACTTGGCG TCTGCGCGAT GATTTTTGCG ACGGCAGTCG GTATTCCGGT TGGTGTGCTG
GCTGCGGTTA AACGTGGTTC CATTTTCGAT CACACAGCGG TTGGCCTGGC GCTGACCGGC
TACTCGATGC CAATTTTCTG GTGGGGCATG ATGCTGATCA TGCTGGTTTC GGTGCACTGG
AACCTGACGC CTGTCTCCGG TCGTGTGAGC GACATGGTGT TCCTCGATGA TTCCAATCCG
TTAACCGGTT TTATGCTGAT CGACACCGCC ATCTGGGGTG AAGACGGTAA CTTTATCGAT
GCCGTCGCCC ATATGATCCT ACCCGCCATT GTGCTGGGTA CCATCCCGCT GGCGGTCATT
GTGCGTATGA CGCGCTCCTC GATGCTGGAA GTGCTGGGCG AAGATTATAT CCGCACCGCG
CGTGCCAAAG GTCTTACCCG GATGCGGGTG ATTATCGTCC ATGCGCTGCG TAACGCGATG
CTGCCGGTGG TGACCGTTAT CGGCCTGCAG GTGGGAACAT TGCTGGCGGG GGCGATTCTG
ACTGAAACCA TCTTCTCGTG GCCCGGTCTG GGGCGTTGGT TGATTGACGC ACTGCAACGC
CGCGATTATC CGGTAGTGCA GGGCGGCGTA TTGCTGGTGG CGACGATGAT TATCCTCGTC
AACCTGCTGG TCGATTTGCT GTACGGCGTG GTGAACCCGC GTATTCGTCA TAAGAAGTAA
 
Protein sequence
MLQFILRRLG LVIPTFIGIT LLTFAFVHMI PGDPVMIMAG ERGISPERHA QLLAELGLDK 
PMWQQYLHYI WGVMHGDLGI SMKSRIPVWE EFVPRFQATL ELGVCAMIFA TAVGIPVGVL
AAVKRGSIFD HTAVGLALTG YSMPIFWWGM MLIMLVSVHW NLTPVSGRVS DMVFLDDSNP
LTGFMLIDTA IWGEDGNFID AVAHMILPAI VLGTIPLAVI VRMTRSSMLE VLGEDYIRTA
RAKGLTRMRV IIVHALRNAM LPVVTVIGLQ VGTLLAGAIL TETIFSWPGL GRWLIDALQR
RDYPVVQGGV LLVATMIILV NLLVDLLYGV VNPRIRHKK