Gene SeSA_A3828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3828 
SymboldppA 
ID6517520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3698394 
End bp3699974 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content53% 
IMG OID642748804 
Productdipeptide ABC transporter periplasmic dipeptide-binding protein 
Protein accessionYP_002116568 
Protein GI194735610 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.808847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGC TTGGTTTGAG CCTGGTGGCC ATGACCGTTG CAGCCAGCGT GCAGGCCAAA 
ACCCTGGTTT ATTGTTCAGA AGGCTCGCCG GAAGGCTTTA ACCCACAGCT CTTTACGTCT
GGCACCACCT ATGATGCCAG CTCCGTACCT ATCTATAACC GTCTGGTTGA ATTCAAAACC
GGCACCACGG AAGTGATCCC GGGTCTTGCT GAGAAGTGGG ATATCAGCGA AGACGGTAAA
ACCTATACGT TCCACCTACG TAAAGGGGTG AAATGGCAAT CCAGCAAGGA TTTCAAACCC
ACGCGCGAGC TGAACGCCGA TGATGTCGTG TTCTCTTTTG ACCGGCAGAA AAACGAGCAG
AACCCGTACC ATAAAGTGTC TGGCGGCAGC TATGAATACT TTGAAGGCAT GGGGCTGCCG
GATCTGATTA GCGAAGTGAA GAAGGTAGAC GATCACACGG TGCAGTTTGT GCTGACGCGT
CCGGAAGCGC CGTTCCTTGC CGATTTAGCC ATGGACTTCG CCTCTATTCT TTCTAAAGAA
TATGCTGACA ACATGCTGAA AGCCGGTACG CCGGAAAAAG TGGATCTGAA CCCGGTCGGC
ACTGGCCCGT TCCAACTGGT GCAATATCAG AAAGATTCCC GCATTCTCTA CAAAGCTTTT
GACGGCTACT GGGGCACGAA GCCGCAGATC GACCGTCTGG TCTTCTCCAT CACCCCTGAC
GCCTCTGTGC GTTACGCCAA ACTGCAGAAG AACGAATGTC AGGTGATGCC GTATCCGAAC
CCGGCGGATA TTGCGCGCAT GAAAGAAGAT AAAAACATCA ACCTGATGGA GCAGGCCGGT
CTGAACGTGG GTTATCTCTC CTACAACGTA CAGAAAAAAC CGCTGGATAA CGTCAAAGTT
CGCCAGGCGT TGACCTATGC CGTGAATAAA GAGGCCATCA TCAAAGCCGT TTATCAGGGC
GCGGGCGTTG CGGCGAAAAA CCTGATCCCG CCGACAATGT GGGGCTACAA CGACGATATT
AAAGACTACG GCTACGATCC GGAAAAAGCG AAGGCGCTGC TGAAAGAAGC CGGTCTGGAA
AAAGGCTTCA CCATCGATCT GTGGGCGATG CCGGTACAGC GTCCCTATAA CCCGAATGCG
CGTCGTATGG CGGAAATGAT CCAGGCGGAC TGGGCGAAGA TTGGCGTTCA GGCCAAAATT
GTCACCTATG AATGGGGCGA ATACCTCAAG CGCGCTAAAG ATGGCGAGCA CCAGACGGTG
ATGATGGGCT GGACCGGCGA TAATGGCGAT CCGGATAACT TCTTCGCCAC GCTGTTCAGC
TGCGATGCCG CCCAGCAAGG CTCCAACTAT TCAAAATGGT GCTACAAGCC GTTTGAAGAC
CTGATTCAGC CTGCGCGTGC GACCGATGAC CACAACAAGC GTATTGAACT CTATAAACAG
GCCCAGGTCG TGATGCATGA CCAGGCGCCA GCGCTGATCA TCGCTCACTC CACGGTTTAT
GAGCCAGTGC GTAAAGAAGT TAAAGGCTAT GTGGTTGATC CATTAGGCAA ACATCACTTC
GAAAACGTCT CTGTCGAATA A
 
Protein sequence
MLKLGLSLVA MTVAASVQAK TLVYCSEGSP EGFNPQLFTS GTTYDASSVP IYNRLVEFKT 
GTTEVIPGLA EKWDISEDGK TYTFHLRKGV KWQSSKDFKP TRELNADDVV FSFDRQKNEQ
NPYHKVSGGS YEYFEGMGLP DLISEVKKVD DHTVQFVLTR PEAPFLADLA MDFASILSKE
YADNMLKAGT PEKVDLNPVG TGPFQLVQYQ KDSRILYKAF DGYWGTKPQI DRLVFSITPD
ASVRYAKLQK NECQVMPYPN PADIARMKED KNINLMEQAG LNVGYLSYNV QKKPLDNVKV
RQALTYAVNK EAIIKAVYQG AGVAAKNLIP PTMWGYNDDI KDYGYDPEKA KALLKEAGLE
KGFTIDLWAM PVQRPYNPNA RRMAEMIQAD WAKIGVQAKI VTYEWGEYLK RAKDGEHQTV
MMGWTGDNGD PDNFFATLFS CDAAQQGSNY SKWCYKPFED LIQPARATDD HNKRIELYKQ
AQVVMHDQAP ALIIAHSTVY EPVRKEVKGY VVDPLGKHHF ENVSVE