Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2107 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 2251098 |
End bp | 2253143 |
Gene Length | 2046 bp |
Protein Length | 681 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | Peptidyl-dipeptidase Dcp |
Protein accession | ACX39762 |
Protein GI | 260449340 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000000100258 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAA TGAATCCTTT CCTTGTGCAA AGCACACTGC CGTATCTGGC TCCCCATTTT GATCAAATTG CCAATCATCA CTATCGCCCG GCATTCGATG AGGGAATGCA GCAAAAGCGG GCAGAAATTG CTGCCATCGC GCTTAACCCG CAAATGCCTG ATTTCAACAA TACTATTCTG GCACTGGAAC AAAGCGGAGA ATTACTTACC CGCGTTACCA GCGTCTTTTT TGCGATGACT GCGGCGCATA CCAATGATGA ATTACAGCGT CTTGACGAGC AGTTTTCCGC TGAACTGGCG GAACTGGCTA ATGATATCTA TCTGAACGGT GAATTATTCG CGCGGGTAGA TGCTGTCTGG CAGCGCCGTG AATCCCTGGG GCTTGATAGT GAATCCATCC GCCTGGTGGA GGTGATTCAT CAACGTTTTG TCCTTGCCGG AGCCAAACTT GCGCAAGCTG ATAAAGCAAA ATTAAAAGTA CTGAATACAG AAGCTGCGAC CCTGACCAGC CAGTTTAACC AGCGATTACT GGCAGCAAAT AAATCCGGCG GTCTGGTTGT GAACGATATC GCGCAGCTGG CAGGAATGAG TGAGCAAGAG ATTGCGCTGG CGGCAGAGGC GGCTCGCGAG AAAGGTCTGG ATAACAAATG GCTGATTCCG CTGCTGAATA CCACCCAACA ACCGGCGCTT GCCGAAATGC GCGATCGTGC GACGCGTGAA AAACTGTTTA TTGCGGGCTG GACGCGAGCG GAAAAAAATG ATGCCAATGA TACCCGCGCT ATCATTCAAC GTCTGGTGGA GATCCGTGCA CAACAGGCAA CACTACTTGG TTTTCCTCAT TATGCCGCAT GGAAAATCGC CGATCAGATG GCAAAAACAC CTGAAGCAGC ACTTAACTTT ATGCGGGAAA TTGTTCCAGC GGCGCGTCAA CGTGCGAGCG ATGAATTAGC CTCCATACAG GCGGTTATCG ATAAGCAGCA GGGCGGGTTT AGCGCGCAGC CGTGGGACTG GGCATTTTAT GCGGAACAGG TACGGCGGGA GAAATTTGAT CTTGATGAGG CGCAGCTCAA GCCATATTTT GAATTAAACA CGGTGTTAAA TGAAGGTGTA TTCTGGACCG CGAATCAGCT CTTCGGTATT AAGTTTGTCG AACGTTTTGA TATTCCTGTC TACCATCCTG ACGTTCGTGT GTGGGAAATT TTTGATCATA ATGGCGTGGG GCTGGCGTTA TTTTACGGTG ATTTCTTCGC CCGTGATTCA AAAAGCGGCG GTGCATGGAT GGGCAATTTT GTTGAGCAAT CAACGCTTAA TAAAACACAT CCGGTAATTT ATAACGTCTG CAATTATCAG AAACCCGCTG CCGGTGAGCC TGCGTTGTTA CTCTGGGATG ATGTCATAAC CTTATTCCAT GAATTTGGTC ATACGCTGCA CGGCCTTTTT GCCCGCCAGC GTTATGCCAC GCTTTCCGGC ACCAACACGC CGCGTGATTT TGTCGAATTT CCGTCGCAAA TCAACGAACA CTGGGCAACG CATCCGCAGG TATTCGCTCG CTACGCCCGG CATTATCAGA GCGGGGCAGC AATGCCTGAC GAACTGCAAC AGAAAATGCG TAATGCCAGC CTGTTCAACA AAGGGTATGA GATGAGCGAA CTGCTTAGCG CCGCACTTCT CGATATGCGC TGGCATTGCC TGGAAGAAAA CGAAGCAATG CAGGATGTCG ATGATTTTGA ATTGCGGGCG CTGGTGGCGG AAAATATGGA TCTTCCTGCT ATACCGCCAC GCTATCGCAG CAGTTATTTC GCCCATATTT TTGGTGGCGG ATATGCTGCA GGTTATTACG CTTATCTGTG GACGCAAATG TTGGCCGATG ATGGTTATCA GTGGTTTGTT GAGCAGGGCG GATTAACGCG TGAAAATGGG CTGCGTTTTC GCGAGGCGAT CCTTTCCAGA GGTAACAGCG AAGATCTGGA ACGCCTGTAT CGACAATGGC GCGGTAAGGC ACCTAAGATT ATGCCGATGC TGCAACATCG TGGCTTGAAC ATATAA
|
Protein sequence | MTTMNPFLVQ STLPYLAPHF DQIANHHYRP AFDEGMQQKR AEIAAIALNP QMPDFNNTIL ALEQSGELLT RVTSVFFAMT AAHTNDELQR LDEQFSAELA ELANDIYLNG ELFARVDAVW QRRESLGLDS ESIRLVEVIH QRFVLAGAKL AQADKAKLKV LNTEAATLTS QFNQRLLAAN KSGGLVVNDI AQLAGMSEQE IALAAEAARE KGLDNKWLIP LLNTTQQPAL AEMRDRATRE KLFIAGWTRA EKNDANDTRA IIQRLVEIRA QQATLLGFPH YAAWKIADQM AKTPEAALNF MREIVPAARQ RASDELASIQ AVIDKQQGGF SAQPWDWAFY AEQVRREKFD LDEAQLKPYF ELNTVLNEGV FWTANQLFGI KFVERFDIPV YHPDVRVWEI FDHNGVGLAL FYGDFFARDS KSGGAWMGNF VEQSTLNKTH PVIYNVCNYQ KPAAGEPALL LWDDVITLFH EFGHTLHGLF ARQRYATLSG TNTPRDFVEF PSQINEHWAT HPQVFARYAR HYQSGAAMPD ELQQKMRNAS LFNKGYEMSE LLSAALLDMR WHCLEENEAM QDVDDFELRA LVAENMDLPA IPPRYRSSYF AHIFGGGYAA GYYAYLWTQM LADDGYQWFV EQGGLTRENG LRFREAILSR GNSEDLERLY RQWRGKAPKI MPMLQHRGLN I
|
| |