Gene SNSL254_A0355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0355 
SymbolpepD 
ID6483683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp365433 
End bp366890 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content53% 
IMG OID642735783 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_002039558 
Protein GI194444329 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGAAC TGTCTCAGTT ATCACCACAA CCTTTGTGGG ATATTTTTGC CAAAATCTGT 
TCTATTCCTC ACCCGTCCTA TCACGAAGAG CAACTCGCGG AACATATTGT GAGTTGGGCC
AAAGAGAAAG GCCTTTACGT GGATCGCGAC CAGGTCGGTA ATATTCTGAT CCGTAAGCCC
GCCACTGCAG GTATGGAAAA TCGCAAGCCG GTCGTTTTAC AGGCGCATCT GGATATGGTG
CCGCAAAAAA ATAGCGACAC CGTTCACGAC TTCACTACAG ATCCTATCCA GCCTTATATT
GATGGCGAAT GGGTTAAAGC GCGCGGTACT ACGCTCGGCG CCGACAATGG TATTGGAATG
GCTTCTGCGT TAGCGGTACT GGCTGACGAT AATGTCGTCC ACGGCCCACT GGAAGTGCTG
CTGACCATGA CCGAAGAAGC GGGCATGGAT GGCGCGTTCG GCCTCCAGTC CGGCTGGTTG
CAGGCCGACA TCCTGATCAA CACCGACTCT GAAGAAGAAG GTGAGATCTA TATGGGCTGC
GCAGGCGGCA TCGATTTTAC CTCTAATCTG CCGTTGACCC GTGAAGCTGT ACCCGCAGGA
TTTGCATGCT TTAAGCTAAC CCTGAAAGGC CTGAAAGGCG GCCACTCCGG TGGTGAAATT
CACCTCGGCC TTGGCAATGC TAACAAATTG CTGGCGCGTT TTCTGGCCGG GCACGCAGAA
GAACTGGATC TGCGTCTGAT CGATTTCAAC GGCGGCACGC TGCGTAACGC GATTCCGCGC
GAAGCGTTCG CTACCCTCGC CGTTGCCGCA GACAACGTAG GCGCGCTGAA AACATTAGTG
AACGCTTACC AGGATATTCT GAAAAACGAA CTGGCGGAAA AAGAGAAAAA CCTGACGCTG
CAACTCAATG AGGTTGCCAG TGATAAAGCC GCATTGACCG CGCCGTCACG CGATACCTTT
GTGCGCCTGC TGAACGCAAC GCCGAACGGC GTGATCCGCA ATTCAGACGT GGCGAAAGGC
GTAGTGGAAA CATCGCTGAA CGTTGGCGTG GTCACAATGT CCGATGCGAA TGTCGAAATT
CACTGCCTGA TTCGCTCTCT AATCGACAGC GGTAAAGATT ATGTGGTGAG TATGCTGGAT
TCGCTGGGCA AGCTGGCTGG CGCGAAAACC GAAGCAAAAG GCAGCTATCC TGGCTGGCAG
CCCGATGCGA ACTCGCCGGT CATGCACCTG GTGCGGGAAA CCTATCAGCG TCTGTTCAAC
AAAACACCGA ACATCCAGAT TATCCACGCC GGCCTGGAAT GCGGTCTGTT TAAGAAACCC
TATCCGGATA TGGACATGGT TTCTATTGGG CCTACCATTA CCGGACCTCA CTCTCCGGAT
GAGCAGGTAC ATATCGAAAG CGTCGGCCAC TACTGGACTC TGCTGACCGA ATTGCTGAAA
GCGATTCCTG CGAAGTAA
 
Protein sequence
MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE QLAEHIVSWA KEKGLYVDRD QVGNILIRKP 
ATAGMENRKP VVLQAHLDMV PQKNSDTVHD FTTDPIQPYI DGEWVKARGT TLGADNGIGM
ASALAVLADD NVVHGPLEVL LTMTEEAGMD GAFGLQSGWL QADILINTDS EEEGEIYMGC
AGGIDFTSNL PLTREAVPAG FACFKLTLKG LKGGHSGGEI HLGLGNANKL LARFLAGHAE
ELDLRLIDFN GGTLRNAIPR EAFATLAVAA DNVGALKTLV NAYQDILKNE LAEKEKNLTL
QLNEVASDKA ALTAPSRDTF VRLLNATPNG VIRNSDVAKG VVETSLNVGV VTMSDANVEI
HCLIRSLIDS GKDYVVSMLD SLGKLAGAKT EAKGSYPGWQ PDANSPVMHL VRETYQRLFN
KTPNIQIIHA GLECGLFKKP YPDMDMVSIG PTITGPHSPD EQVHIESVGH YWTLLTELLK
AIPAK