Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0264 |
Symbol | pepD |
ID | 5592186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 283905 |
End bp | 285362 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640919450 |
Product | aminoacyl-histidine dipeptidase |
Protein accession | YP_001457037 |
Protein GI | 157159719 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTGAAC TGTCTCAATT ATCTCCACAG CCGCTGTGGG ATATTTTTGC CAAAATCTGT TCTATTCCTC ACCCGTCCTA TCATGAAGAG CAACTCGCTG AATACATTGT TGGTTGGGCA AAAGAGAAAG GTTTCCATGT CGAACGCGAT CAGGTAGGTA ATATCCTGAT TCGTAAACCT GCTACCGCAG GTATGGAAAA TCGTAAACCG GTCGTCTTAC AGGCCCACCT CGATATGGTG CCGCAGAAAA ATAACGACAC CGTGCATGAC TTCACGAAAG ATCCTATCCA GCCTTATATT GATGGCGAAT GGGTTAAAGC GCGCGGCACC ACGCTGGGTG CGGATAACGG CATTGGTATG GCCTCTGCAC TGGCAGTTCT GGCTGACGAA AACGTGGTTC ACGGCCCGCT GGAAGTGCTG CTGACCATGA CCGAAGAAGC CGGTATGGAC GGTGCGTTCG GCTTACAGAG CAACTGGTTG CAGGCTGATA TTCTGATTAA CACCGACTCC GAAGAAGAAG GTGAAATCTA CATGGGTTGT GCGGGGGGTA TCGACTTCAC CTCCAACCTG CATTTAGATC GTGAAGCGGT TCCAGCTGGT TTTGAAACCT TCAAGTTAAC CTTAAAAGGT CTGAAAGGGG GTCACTCCGG CGGGGAAATC CACGTTGGCC TGGGTAATGC CAACAAACTG CTGGTGCGCT TCCTGGCGGG TCATGCAGAA GAACTGGATC TGCGCCTTAT CGATTTCAAC GGCGGCACAC TGCGTAACGC CATCCCGCGT GAAGCCTTTG CGACCATTGC TGTCGCAGCT GATAAAGTCG ACGTCCTGAA ATCTCTGGTG AATACCTATC AGGAGATCCT GAAAAACGAG CTGGCAGAGA AAGAGAAAAA TCTGGCCTTG TTGCTGGACT CTGTAGCGAA CGATAAAGCT GCCCTGATTG CGAAATCTCG CGATACCTTT ATTCGTCTGC TGAACGCCAC CCCGAACGGT GTGATCCGCA ACTCAGACGT GGCAAAAGGT GTGGTCGAAA CCTCCCTGAA CGTCGGTGTG GTGACCATGA CTGATAATAA CGTAGAAATT CACTGTCTGA TCCGTTCACT GATCGACAGC GGTAAAGACT ACGTGGTGAG CATGCTGGAT TCGCTGGGTA AACTGGCTGG CGCGAAAACC GAAGCGAAAG GCGCATATCC TGGCTGGCAG CCGGACGCTA ATTCTCCGGT GATGCATCTG GTACGTGAAA CCTATCAGCG TCTGTTCAAC AAAACGCCGA ACATCCAGAT TATCCACGCG GGCCTGGAAT GTGGTCTGTT CAAAAAACCG TATCCGGAAA TGGACATGGT TTCTATCGGG CCAACTATCA CCGGTCCACA CTCTCCGGAT GAGCAAGTTC ACATCGAAAG CGTAGGTCAT TACTGGACAC TGCTGACTGA ACTGCTGAAA GAAATTCCGG CGAAGTAA
|
Protein sequence | MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE QLAEYIVGWA KEKGFHVERD QVGNILIRKP ATAGMENRKP VVLQAHLDMV PQKNNDTVHD FTKDPIQPYI DGEWVKARGT TLGADNGIGM ASALAVLADE NVVHGPLEVL LTMTEEAGMD GAFGLQSNWL QADILINTDS EEEGEIYMGC AGGIDFTSNL HLDREAVPAG FETFKLTLKG LKGGHSGGEI HVGLGNANKL LVRFLAGHAE ELDLRLIDFN GGTLRNAIPR EAFATIAVAA DKVDVLKSLV NTYQEILKNE LAEKEKNLAL LLDSVANDKA ALIAKSRDTF IRLLNATPNG VIRNSDVAKG VVETSLNVGV VTMTDNNVEI HCLIRSLIDS GKDYVVSMLD SLGKLAGAKT EAKGAYPGWQ PDANSPVMHL VRETYQRLFN KTPNIQIIHA GLECGLFKKP YPEMDMVSIG PTITGPHSPD EQVHIESVGH YWTLLTELLK EIPAK
|
| |