Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1628 |
Symbol | dcp |
ID | 5591257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1648403 |
End bp | 1650448 |
Gene Length | 2046 bp |
Protein Length | 681 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920776 |
Product | dipeptidyl carboxypeptidase II |
Protein accession | YP_001458332 |
Protein GI | 157161014 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000000000356686 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAA TGAATCCTTT CCTTGTGCAA AGCACACTGC CGTATCTGGC TCCTCATTTT GATCAAATTG CCAATCATCA CTATCGCCCG GCATTCGATG AGGGAATGCA GCAAAAGCGG GCAGAAATTG CTGCCATCGC GCTTAACCCG CAAACGCCTG ATTTCAACAA TACTATTCTG GCACTGGAAC AAAGCGGAGA ATTACTTACC CGCGTTACCA GCGTCTTTTT TGCGATGACT GCGGCGCATA CCAATGATGA ATTACAGCGT CTTGATGAAC AGTTTTCCGC TGAACTGGCG GAACTGGCTA ATGATATCTA TCTGAACGGT GAATTATTCG CGCGGGTAGA TGCTGTCTGG CAGCGCCGTG AATCCCTGGG GCTTGATAGT GAATCCATCC GCCTGGTGGA GGTGATTCAT CAACGTTTTG TCCTTGCCGG AGCCAAACTT GCGCAAGCTG ATAAAGCAAA ATTAAAAGTA CTGAATACAG AAGCTGCGAC ACTGACCAGT CAGTTTAACC AGCGATTACT GGCAGCAAAT AAATCCGGCG GTCTGGTTGT GAACGATATC GCGCAGCTGG CAGGAATGAG TGAGCAAGAG ATTGCGCTGG CGGCAGAGGC GGCTCGCGAG AAAGGTCTGG ATAACAAATG GCTGATTCCG CTGCTGAATA CCACCCAACA ACCGGCGCTT GCCGAGATGC GCGATCGTGC GACGCGTGAA AAACTGTTTA TTGCGGGCTG GACGCGAGCG GAAAAAAATG ATGCCAATGA TACCCGCGCT ATCATTCAAC GTCTGGTAGA GATTCGCGCA CAGCAGGCGA AACTGCTTGG TTTTCCTCAT TATGCCGCAT GGAAAATCGC CGATCAGATG GCAAAAACAC CTGAAGCAGC ACTTAACTTT ATGCGGGAAA TTGTTCCAGC GGCGCGTCAA CGTGCGAGCG ATGAATTAGC CTCCATACAG GCGGTTATTG ATAAGCAGCA GGGCGGGTTT AGCGCGCAGC CGTGGGACTG GGCATTTTAT GCCGAACAGG TACGGCGGGA GAAATTTGAT CTTGATGAGG CGCAGCTCAA GCCATATTTT GAATTAAACA CGGTGTTGAA TGAAGGTGTA TTCTGGACCG CGAATCAGCT CTTCGGTATT AAGTTTGTCG AACGTTTTGA TATTCCTGTC TACCATCCTG ACGTTCGTGT GTGGGAAATT TTTGATCATA ATGGCGTGGG ACTGGCGTTA TTTTACGGTG ATTTCTTCGC CCGTGATTCA AAAAGCGGCG GTGCATGGAT GGGCAATTTT GTTGAGCAAT CAACGCTTAA TGAAACGCAT CCGGTAATTT ATAACGTCTG CAATTATCAG AAACCCGCTG CCGGTGAGCC TGCGTTGTTA CTCTGGGATG ATGTAATAAC CTTATTCCAT GAATTTGGTC ATACGCTGCA CGGCCTTTTT GCCCGCCAGC GTTATGCCAC GCTTTCCGGC ACCAACACGC CGCGTGATTT TGTCGAATTT CCGTCGCAAA TCAACGAACA CTGGGCAACG CATCCGCAGG TATTCGCTCG CTACGCCCGG CATTATCAGA GCGGGGCAGC AATGCCTGAC GAACTGCAAC AGAAAATGCG TAATGCCAGC CTGTTCAACA AAGGGTATGA GATGAGCGAA CTGCTTAGCG CCGCACTTCT CGATATGCGC TGGCATTGCC TGGAAGAAAA CGAAGCAATG CAGGATGTCG ATGATTTTGA ATTGCGGGCG CTGGTGGCGG AAAATATGGA TCTTCCTGCT ATACCGCCAC GCTATCGCAG CAGTTATTTC GCCCATATTT TTGGTGGCGG ATATGCTGCA GGTTATTACG CTTATCTGTG GACGCAAATG TTGGCCGATG ATGGTTACCA GTGGTTTGTT GAGCAGGGCG GATTAACGCG TGAAAATGGG CAGCGTTTTC GCGAGGCGAT CCTTTCCAGA GGTAACAGCG AAGATCTGGA ACGCCTGTAT CGACAATGGC GCGGTAAGGC TCCTCAGATT ATGCCGATGC TGCAACATCG TGGCTTGAAC ATATAA
|
Protein sequence | MTTMNPFLVQ STLPYLAPHF DQIANHHYRP AFDEGMQQKR AEIAAIALNP QTPDFNNTIL ALEQSGELLT RVTSVFFAMT AAHTNDELQR LDEQFSAELA ELANDIYLNG ELFARVDAVW QRRESLGLDS ESIRLVEVIH QRFVLAGAKL AQADKAKLKV LNTEAATLTS QFNQRLLAAN KSGGLVVNDI AQLAGMSEQE IALAAEAARE KGLDNKWLIP LLNTTQQPAL AEMRDRATRE KLFIAGWTRA EKNDANDTRA IIQRLVEIRA QQAKLLGFPH YAAWKIADQM AKTPEAALNF MREIVPAARQ RASDELASIQ AVIDKQQGGF SAQPWDWAFY AEQVRREKFD LDEAQLKPYF ELNTVLNEGV FWTANQLFGI KFVERFDIPV YHPDVRVWEI FDHNGVGLAL FYGDFFARDS KSGGAWMGNF VEQSTLNETH PVIYNVCNYQ KPAAGEPALL LWDDVITLFH EFGHTLHGLF ARQRYATLSG TNTPRDFVEF PSQINEHWAT HPQVFARYAR HYQSGAAMPD ELQQKMRNAS LFNKGYEMSE LLSAALLDMR WHCLEENEAM QDVDDFELRA LVAENMDLPA IPPRYRSSYF AHIFGGGYAA GYYAYLWTQM LADDGYQWFV EQGGLTRENG QRFREAILSR GNSEDLERLY RQWRGKAPQI MPMLQHRGLN I
|
| |