Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00235 |
Symbol | pepD |
ID | 8113448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 258746 |
End bp | 260203 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644846525 |
Product | hypothetical protein |
Protein accession | YP_002998098 |
Protein GI | 251783794 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTGAAC TGTCTCAATT ATCTCCACAG CCGCTGTGGG ATATTTTTGC CAAAATCTGT TCTATTCCTC ACCCGTCCTA TCATGAAGAG CAACTCGCTG AATACATTGT TGGTTGGGCA AAAGAGAAAG GTTTCCATGT CGAACGCGAT CAGGTAGGTA ATATCCTGAT TCGTAAACCT GCCACCGCAG GTATGGAAAA TCGTAAACCG GTCGTCTTGC AGGCCCACCT CGATATGGTG CCGCAGAAAA ATAACGACAC CGTGCATGAC TTCACGAAAG ATCCTATCCA GCCTTATATT GATGGCGAAT GGGTTAAAGC GCGCGGCACC ACGCTGGGTG CAGATAACGG CATTGGTATG GCGTCTGCGC TGGCGGTTCT GGCTGACGAA AACGTGGTTC ACGGCCCGCT GGAAGTGCTG CTGACCATGA CCGAAGAAGC CGGTATGGAC GGTGCGTTCG GCTTACAGAG CAACTGGTTG CAGGCTGATA TTCTGATTAA CACCGACTCC GAAGAAGAAG GTGAAATCTA CATGGGTTGC GCGGGGGGTA TCGACTTCAC CTCCAACCTG CATTTGGATC GTGAAGCGGT TCCAGCTGGC TTTGAAACCT TCAAGTTAAC CTTAAAAGGT CTAAAAGGCG GTCACTCCGG CGGTGAAATC CACGTTGGCC TGGGTAATGC CAACAAACTG CTGGTGCGCT TCCTGGCGGG TCATGCGGAA GAGCTGGACC TGCGCCTTAT CGATTTCAAC GGTGGCACAC TGCGTAACGC CATCCCGCGT GAAGCCTTTG CGACCATTGC TGTCGCAGCT GATAAAGTCG ACGCCCTGAA ATCTCTGGTG AATACCTATC AGGACATCCT GAAAAACGAG CTGGCAGAGA AAGAGAAGAA TCTGGCCTTG TTGCTGGACT CTGTAGCGAA CGATAAAGCT GCTCTGATTG CGAAATCTCG CGATACCTTT ATTCGTCTGC TGAACGCCAC CCCGAACGGT GTGATCCGCA ATTCCGACGT GGCAAAAGGT GTGGTCGAAA CCTCCCTGAA CGTCGGTGTG GTGACCATGA CTGACAATAA CGTAGAAATT CACTGCCTGA TCCGTTCACT GATCGACAGC GGTAAAGACT ACGTGGTGAG CATGCTGGAT TCGCTGGGTA AACTGGCTGG CGCGAAAACC GAAGCGAAAG GCGCATATCC TGGCTGGCAG CCGGACGCTA ATTCTCCGGT GATGCATCTG GTACGTGAAA CCTATCAGCG CCTGTTCAAC AAGACGCCGA ACATCCAGAT TATCCACGCG GGCCTGGAAT GTGGTCTGTT TAAAAAACCG TATCCGGAAA TGGACATGGT TTCTATCGGG CCAACTATCA CCGGTCCACA CTCTCCGGAT GAGCAAGTTC ACATCAAAAG CGTAGGTCAT TACTGGACAC TGCTGACTGA ACTGCTGAAA GAAATTCCGG CGAAGTAA
|
Protein sequence | MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE QLAEYIVGWA KEKGFHVERD QVGNILIRKP ATAGMENRKP VVLQAHLDMV PQKNNDTVHD FTKDPIQPYI DGEWVKARGT TLGADNGIGM ASALAVLADE NVVHGPLEVL LTMTEEAGMD GAFGLQSNWL QADILINTDS EEEGEIYMGC AGGIDFTSNL HLDREAVPAG FETFKLTLKG LKGGHSGGEI HVGLGNANKL LVRFLAGHAE ELDLRLIDFN GGTLRNAIPR EAFATIAVAA DKVDALKSLV NTYQDILKNE LAEKEKNLAL LLDSVANDKA ALIAKSRDTF IRLLNATPNG VIRNSDVAKG VVETSLNVGV VTMTDNNVEI HCLIRSLIDS GKDYVVSMLD SLGKLAGAKT EAKGAYPGWQ PDANSPVMHL VRETYQRLFN KTPNIQIIHA GLECGLFKKP YPEMDMVSIG PTITGPHSPD EQVHIKSVGH YWTLLTELLK EIPAK
|
| |