Gene SeD_A0346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0346 
SymbolpepD 
ID6871075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp364543 
End bp366000 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content53% 
IMG OID642783583 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_002214271 
Protein GI198245847 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0296229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGAAC TGTCTCAGTT ATCACCACAA CCTTTGTGGG ATATTTTTGC CAAAATCTGT 
TCTATTCCTC ACCCGTCCTA TCACGAAGAG CAACTCGCGG AACATATTGT GAGTTGGGCC
AAAGAGAAAG GCCTTTACGT GGATCGCGAC CAGGTCGGTA ATATTCTGAT CCGTAAGCCC
GCCACTGCAG GTATGGAAAA TCGCAAGCCG GTCGTTTTAC AGGCGCATCT GGATATGGTG
CCGCAAAAAA ATAGCGACAC CGTTCACGAC TTCACTACAG ATCCTATCCA GCCTTATATT
GATGGCGAAT GGGTTAAAGC GCGCGGTACT ACGCTCGGCG CCGACAATGG TATTGGAATG
GCTTCTGCGT TAGCGGTACT GGCTGATGAT AATGTCGTCC ATGGCCCGCT GGAAGTGCTG
CTGACCATGA CCGAAGAAGC GGGCATGGAT GGCGCGTTCG GCCTCCAGTC CGGCTGGTTG
CAGGCCGACA TCCTGATCAA CACCGACTCT GAAGAAGAAG GTGAGATCTA TATGGGCTGC
GCAGGCGGTA TTGATTTTAC CTCTAATCTG CCGTTGACCC GTGAAGCTGT ACCCGCAGGA
TTTGCGTGCT TTAAGCTAAC CCTGAAAGGC CTGAAAGGCG GCCACTCCGG TGGTGAAATT
CACCTCGGTC TTGGCAATGC CAACAAATTG CTGGCGCGTT TTCTGGCCGG GCACGCAGAA
GAACTGGATC TGCGTCTGAT CGATTTCAAC GGCGGCACGC TGCGTAACGC GATTCCGCGC
GAAGCGTTCG CTACCCTCGC CGTTGCCGCA GACAACGTAG GCGCTCTGAA AACATTAGTG
AACGCTTACC AGGATATTCT GAAAAACGAA CTGGCGGAAA AAGAGAAAAA CCTGACGCTG
CAACTCAATG AGGTTGCCAG TGATAAAGCC GCATTGACCG CGCCGTCACG CGATACCTTT
GTGCGCCTGC TGAACGCAAC GCCGAACGGC GTGATCCGCA ATTCAGACGT GGCGAAAGGC
GTAGTGGAAA CATCGCTGAA CGTTGGCGTG GTCACAATGT CTGATGCGAA TGTCGAAATT
CACTGCCTGA TTCGCTCTCT TATCGACAGC GGTAAAGATT ATGTGGTGAG TATGCTGGAT
TCGCTGGGCA AGCTGGCTGG CGCGAAAACC GAAGCAAAAG GCAGCTATCC TGGCTGGCAG
CCCGATGCGA ACTCGCCGGT CATGCACCTG GTGCGGGAAA CCTATCAGCG TCTGTTTAAC
AAGACACCTA ACATCCAGAT TATCCACGCC GGCCTGGAAT GCGGTCTGTT TAAGAAACCC
TATCCGGATA TGGACATGGT TTCTATTGGG CCTACCATTA CCGGACCTCA CTCTCCGGAT
GAGCAGGTAC ATATCGAAAG CGTCGGCCAC TACTGGACTC TGCTGACCGA ATTGCTGAAA
GCGATTCCTG CGAAGTAA
 
Protein sequence
MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE QLAEHIVSWA KEKGLYVDRD QVGNILIRKP 
ATAGMENRKP VVLQAHLDMV PQKNSDTVHD FTTDPIQPYI DGEWVKARGT TLGADNGIGM
ASALAVLADD NVVHGPLEVL LTMTEEAGMD GAFGLQSGWL QADILINTDS EEEGEIYMGC
AGGIDFTSNL PLTREAVPAG FACFKLTLKG LKGGHSGGEI HLGLGNANKL LARFLAGHAE
ELDLRLIDFN GGTLRNAIPR EAFATLAVAA DNVGALKTLV NAYQDILKNE LAEKEKNLTL
QLNEVASDKA ALTAPSRDTF VRLLNATPNG VIRNSDVAKG VVETSLNVGV VTMSDANVEI
HCLIRSLIDS GKDYVVSMLD SLGKLAGAKT EAKGSYPGWQ PDANSPVMHL VRETYQRLFN
KTPNIQIIHA GLECGLFKKP YPDMDMVSIG PTITGPHSPD EQVHIESVGH YWTLLTELLK
AIPAK