Gene EcHS_A0264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0264 
SymbolpepD 
ID5592186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp283905 
End bp285362 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content51% 
IMG OID640919450 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_001457037 
Protein GI157159719 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGAAC TGTCTCAATT ATCTCCACAG CCGCTGTGGG ATATTTTTGC CAAAATCTGT 
TCTATTCCTC ACCCGTCCTA TCATGAAGAG CAACTCGCTG AATACATTGT TGGTTGGGCA
AAAGAGAAAG GTTTCCATGT CGAACGCGAT CAGGTAGGTA ATATCCTGAT TCGTAAACCT
GCTACCGCAG GTATGGAAAA TCGTAAACCG GTCGTCTTAC AGGCCCACCT CGATATGGTG
CCGCAGAAAA ATAACGACAC CGTGCATGAC TTCACGAAAG ATCCTATCCA GCCTTATATT
GATGGCGAAT GGGTTAAAGC GCGCGGCACC ACGCTGGGTG CGGATAACGG CATTGGTATG
GCCTCTGCAC TGGCAGTTCT GGCTGACGAA AACGTGGTTC ACGGCCCGCT GGAAGTGCTG
CTGACCATGA CCGAAGAAGC CGGTATGGAC GGTGCGTTCG GCTTACAGAG CAACTGGTTG
CAGGCTGATA TTCTGATTAA CACCGACTCC GAAGAAGAAG GTGAAATCTA CATGGGTTGT
GCGGGGGGTA TCGACTTCAC CTCCAACCTG CATTTAGATC GTGAAGCGGT TCCAGCTGGT
TTTGAAACCT TCAAGTTAAC CTTAAAAGGT CTGAAAGGGG GTCACTCCGG CGGGGAAATC
CACGTTGGCC TGGGTAATGC CAACAAACTG CTGGTGCGCT TCCTGGCGGG TCATGCAGAA
GAACTGGATC TGCGCCTTAT CGATTTCAAC GGCGGCACAC TGCGTAACGC CATCCCGCGT
GAAGCCTTTG CGACCATTGC TGTCGCAGCT GATAAAGTCG ACGTCCTGAA ATCTCTGGTG
AATACCTATC AGGAGATCCT GAAAAACGAG CTGGCAGAGA AAGAGAAAAA TCTGGCCTTG
TTGCTGGACT CTGTAGCGAA CGATAAAGCT GCCCTGATTG CGAAATCTCG CGATACCTTT
ATTCGTCTGC TGAACGCCAC CCCGAACGGT GTGATCCGCA ACTCAGACGT GGCAAAAGGT
GTGGTCGAAA CCTCCCTGAA CGTCGGTGTG GTGACCATGA CTGATAATAA CGTAGAAATT
CACTGTCTGA TCCGTTCACT GATCGACAGC GGTAAAGACT ACGTGGTGAG CATGCTGGAT
TCGCTGGGTA AACTGGCTGG CGCGAAAACC GAAGCGAAAG GCGCATATCC TGGCTGGCAG
CCGGACGCTA ATTCTCCGGT GATGCATCTG GTACGTGAAA CCTATCAGCG TCTGTTCAAC
AAAACGCCGA ACATCCAGAT TATCCACGCG GGCCTGGAAT GTGGTCTGTT CAAAAAACCG
TATCCGGAAA TGGACATGGT TTCTATCGGG CCAACTATCA CCGGTCCACA CTCTCCGGAT
GAGCAAGTTC ACATCGAAAG CGTAGGTCAT TACTGGACAC TGCTGACTGA ACTGCTGAAA
GAAATTCCGG CGAAGTAA
 
Protein sequence
MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE QLAEYIVGWA KEKGFHVERD QVGNILIRKP 
ATAGMENRKP VVLQAHLDMV PQKNNDTVHD FTKDPIQPYI DGEWVKARGT TLGADNGIGM
ASALAVLADE NVVHGPLEVL LTMTEEAGMD GAFGLQSNWL QADILINTDS EEEGEIYMGC
AGGIDFTSNL HLDREAVPAG FETFKLTLKG LKGGHSGGEI HVGLGNANKL LVRFLAGHAE
ELDLRLIDFN GGTLRNAIPR EAFATIAVAA DKVDVLKSLV NTYQEILKNE LAEKEKNLAL
LLDSVANDKA ALIAKSRDTF IRLLNATPNG VIRNSDVAKG VVETSLNVGV VTMTDNNVEI
HCLIRSLIDS GKDYVVSMLD SLGKLAGAKT EAKGAYPGWQ PDANSPVMHL VRETYQRLFN
KTPNIQIIHA GLECGLFKKP YPEMDMVSIG PTITGPHSPD EQVHIESVGH YWTLLTELLK
EIPAK