Gene SeHA_C0357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0357 
SymbolpepD 
ID6487929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp368937 
End bp370394 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content53% 
IMG OID642740632 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_002044300 
Protein GI194448410 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0000000446766 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCTGAAC TGTCTCAGTT ATCCCCACAA CCTTTGTGGG ATATTTTTGC CAAAATCTGT 
TCTATTCCTC ACCCGTCCTA TCACGAAGAG CAACTCGCGG AACATATTGT GAGTTGGGCC
AAAGAGAAAG GCCTTTACGT GGATCGCGAC CAGGTCGGTA ATATTCTGAT CCGTAAGCCC
GCCACTGCAG GTATGGAAAA TCGCAAGCCG GTCGTTTTAC AGGCGCATCT GGATATGGTG
CCGCAAAAAA ATAGCGACAC CGTTCACGAC TTCACTACAG ATCCTATCCA GCCTTATATT
GATGGCGAAT GGGTTAAAGC GCGCGGTACT ACGCTCGGCG CCGACAATGG TATTGGAATG
GCTTCTGCGT TAGCGGTACT GGCTGATGAT AATGTCGTCC ATGGCCCGCT GGAAGTGCTG
CTGACCATGA CCGAAGAAGC GGGCATGGAT GGCGCGTTCG GCCTCCAGTC CGGCTGGTTG
CAGGCCGACA TCCTGATCAA CACCGACTCT GAAGAAGAAG GTGAGATCTA TATGGGCTGC
GCAGGCGGTA TTGATTTTAC CTCTAATCTG CCGTTGACCC GTGAAGCTGT ACCCGCAGGA
TTTGCGTGCT TTAAGCTAAC CCTGAAAGGC CTGAAAGGCG GCCACTCCGG TGGTGAAATT
CACCTCGGTC TTGGCAATGC CAACAAATTG CTGGCGCGTT TTCTGGCCGG GCACGCAGAA
GAACTGGATC TGCGTCTGAT CGATTTCAAC GGCGGCACGC TGCGTAACGC GATTCCGCGC
GAAGCGTTCG CTACCCTCGC CGTTGCCGCA GACAACGTAG GCGCGCTGAA AACATTAGTG
AACGCTTACC AGGATATTCT GAAAAACGAA CTGGCGGAAA AAGAGAAAAA CCTGACGCTG
CAACTCAATG AGGTTGCCAG TGATAAAGCC GCATTGACCG CGCCGTCACG CGATACCTTT
GTGCGCCTGC TGAACGCAAC GCCGAACGGC GTGATCCGCA ATTCAGACGT GGCGAAAGGC
GTAGTGGAAA CATCGCTGAA CGTTGGCGTG GTCACAATGT CCGATGCGAA TGTCGAAATT
CACTGCCTGA TTCGCTCTCT TATCGACAGC GGTAAAGATT ATGTGGTGAG TATGCTGGAT
TCGCTGGGCA AGCTGGCTGG CGCGAAAACC GAAGCAAAAG GCAGCTATCC TGGCTGGCAG
CCCGATGCGA ACTCGCCGGT CATGCACCTG GTGCGGGAAA CCTATCAGCG TCTGTTCAAC
AAGACACCGA ACATCCAGAT TATCCACGCC GGCCTGGAAT GCGGTCTGTT TAAGAAACCC
TATCCGGATA TGGACATGGT TTCTATTGGG CCTACCATTA CCGGACCTCA CTCTCCGGAT
GAGCAGGTAC ATATCGAAAG CGTCGGCCAC TACTGGACTC TGCTGACCGA ATTGCTGAAA
GCGATTCCTG CGAAGTAA
 
Protein sequence
MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE QLAEHIVSWA KEKGLYVDRD QVGNILIRKP 
ATAGMENRKP VVLQAHLDMV PQKNSDTVHD FTTDPIQPYI DGEWVKARGT TLGADNGIGM
ASALAVLADD NVVHGPLEVL LTMTEEAGMD GAFGLQSGWL QADILINTDS EEEGEIYMGC
AGGIDFTSNL PLTREAVPAG FACFKLTLKG LKGGHSGGEI HLGLGNANKL LARFLAGHAE
ELDLRLIDFN GGTLRNAIPR EAFATLAVAA DNVGALKTLV NAYQDILKNE LAEKEKNLTL
QLNEVASDKA ALTAPSRDTF VRLLNATPNG VIRNSDVAKG VVETSLNVGV VTMSDANVEI
HCLIRSLIDS GKDYVVSMLD SLGKLAGAKT EAKGSYPGWQ PDANSPVMHL VRETYQRLFN
KTPNIQIIHA GLECGLFKKP YPDMDMVSIG PTITGPHSPD EQVHIESVGH YWTLLTELLK
AIPAK