Gene SeD_A4009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4009 
SymboldppB 
ID6875034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3853021 
End bp3854040 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content57% 
IMG OID642786963 
Productdipeptide transporter permease DppB 
Protein accessionYP_002217591 
Protein GI198244490 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCAGT TCATTCTCCG ACGTTTGGGA CTCGTTATCC CGACGTTTAT CGGTATTACC 
CTTCTCACCT TTGCCTTTGT CCACATGATC CCCGGCGATC CGGTGATGAT CATGGCGGGT
GAGCGAGGTA TTTCCCCTGA GCGTCATGCT CAACTGCTGG CTGAACTCGG TCTTGATAAA
CCGATGTGGC AGCAGTACCT CCATTATATC TGGGGGGTGA TGCATGGCGA TTTAGGCATC
TCGCTGAAAA GCCGAATCCC CGTGTGGGAC GAGTTCGTGC CTCGCTTTAA AGCGACGCTG
GAGCTTGGCG TCTGCGCCAT GATTTTTGCC GTCGCCGTGG GGATTCCGGT GGGCGTGCTG
GCCGCAGTGA AGCGCGGTTC TATCTTCGAT CACACTGCCG TTGGCCTGGC GCTGACCGGT
TACTCTATGC CTATCTTCTG GTGGGGCATG ATGCTGATCA TGCTGGTTTC CGTCCACTGG
AACCTGACGC CGGTTTCCGG GCGCGTGAGC GATATGGTGT TCCTTGATGA TACCAATCCG
TTGACGGGCT TTATGCTGAT CGACACCGCT ATCTGGGGCG AAGAGGGTAA CTTTATTGAT
GCGCTGGCGC ATATGATCCT GCCTGCGATG GTGCTCGGCA CAATCCCGCT GGCCGTCATT
GTGCGTATGA CCCGTTCGTC GATGCTGGAA GTGCTGGGGG AGGATTACAT CCGTACCGCA
CGCGCCAAAG GGTTGACCAG GATGCGCGTC ATTATCGTCC ATGCTCTGCG TAACGCTATG
CTGCCAGTCG TCACCGTGAT CGGCCTGCAG GTCGGGACGC TGTTGGCGGG CGCGATTCTG
ACAGAAACTA TCTTCTCGTG GCCCGGTCTG GGGCGCTGGC TGATCGATGC GCTGCAACGC
CGCGATTATC CGGTAGTGCA GGGCGGCGTG TTACTGGTAG CGACGATGAT TATTCTCGTC
AACCTGCTGG TAGACCTGCT GTACGGCGTG GTGAACCCGC GTATTCGGCA TAAGAAGTAA
 
Protein sequence
MLQFILRRLG LVIPTFIGIT LLTFAFVHMI PGDPVMIMAG ERGISPERHA QLLAELGLDK 
PMWQQYLHYI WGVMHGDLGI SLKSRIPVWD EFVPRFKATL ELGVCAMIFA VAVGIPVGVL
AAVKRGSIFD HTAVGLALTG YSMPIFWWGM MLIMLVSVHW NLTPVSGRVS DMVFLDDTNP
LTGFMLIDTA IWGEEGNFID ALAHMILPAM VLGTIPLAVI VRMTRSSMLE VLGEDYIRTA
RAKGLTRMRV IIVHALRNAM LPVVTVIGLQ VGTLLAGAIL TETIFSWPGL GRWLIDALQR
RDYPVVQGGV LLVATMIILV NLLVDLLYGV VNPRIRHKK