Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_0472 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 501654 |
End bp | 503021 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | protease Do |
Protein accession | ACX38160 |
Protein GI | 260447738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00978488 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC AAACCCAGCT GTTGAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTCTCG GCGTCATTTC AGGCCGTCGC GTCGATTCCA GGCCAGGTTG CCGATCAGGC CCCTCTCCCC AGTCTGGCTC CAATGCTGGA AAAAGTGCTT CCGGCAGTGG TGAGCGTACG GGTGGAAGGA ACGGCCAGTC AGGGACAGAA AATCCCGGAA GAATTCAAAA AGTTTTTTGG TGATGATTTA CCGGATCAAC CTGCACAACC CTTCGAAGGT TTAGGCTCCG GTGTCATCAT CAACGCCAGT AAAGGCTATG TGCTGACCAA CAACCATGTG ATTAATCAGG CACAGAAAAT CAGTATTCAG CTCAATGATG GGCGCGAGTT TGATGCAAAA CTGATTGGTA GCGATGACCA GAGCGATATC GCCCTGTTAC AAATTCAAAA CCCGAGCAAA TTAACGCAAA TCGCTATTGC CGACTCCGAT AAATTGCGCG TCGGTGATTT TGCCGTAGCG GTCGGTAACC CATTTGGCCT TGGGCAAACC GCCACCTCTG GCATTGTTTC CGCATTAGGC CGCAGCGGGT TGAATCTTGA AGGTCTGGAA AACTTTATCC AGACAGATGC TTCCATTAAC CGCGGTAACT CCGGCGGTGC ACTATTAAAC CTTAACGGTG AGTTAATTGG CATCAACACT GCAATCCTTG CGCCTGGCGG CGGGAGCGTC GGGATTGGAT TTGCCATCCC CAGTAATATG GCGCGAACAC TGGCGCAGCA GCTTATCGAC TTTGGTGAAA TCAAACGCGG TTTGTTAGGC ATCAAAGGCA CCGAGATGAG TGCCGATATC GCCAAAGCCT TCAACCTTGA CGTGCAGCGT GGCGCGTTTG TCAGCGAAGT GTTGCCAGGT TCTGGCTCGG CAAAAGCGGG CGTCAAAGCG GGCGATATTA TTACCAGCCT CAACGGCAAA CCGCTGAATA GCTTTGCTGA GTTGCGCTCT CGTATCGCGA CCACCGAGCC GGGCACGAAA GTGAAGCTTG GCCTGCTGCG TAACGGCAAA CCACTGGAAG TAGAAGTGAC GCTCGATACC AGCACCTCTT CGTCGGCCAG CGCTGAAATG ATCACGCCAG CGCTGGAAGG TGCAACGTTG AGCGATGGTC AGCTAAAAGA TGGCGGCAAA GGTATTAAAA TCGATGAAGT TGTCAAAGGA AGCCCAGCTG CTCAGGCTGG CTTGCAAAAA GACGATGTGA TCATTGGCGT CAACCGCGAT CGGGTGAACT CGATTGCTGA AATGCGTAAA GTGCTGGCGG CAAAACCGGC CATCATCGCC CTGCAAATTG TACGCGGCAA TGAAAGCATC TATCTGCTGA TGCGTTAA
|
Protein sequence | MKKQTQLLSA LALSVGLTLS ASFQAVASIP GQVADQAPLP SLAPMLEKVL PAVVSVRVEG TASQGQKIPE EFKKFFGDDL PDQPAQPFEG LGSGVIINAS KGYVLTNNHV INQAQKISIQ LNDGREFDAK LIGSDDQSDI ALLQIQNPSK LTQIAIADSD KLRVGDFAVA VGNPFGLGQT ATSGIVSALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSV GIGFAIPSNM ARTLAQQLID FGEIKRGLLG IKGTEMSADI AKAFNLDVQR GAFVSEVLPG SGSAKAGVKA GDIITSLNGK PLNSFAELRS RIATTEPGTK VKLGLLRNGK PLEVEVTLDT STSSSASAEM ITPALEGATL SDGQLKDGGK GIKIDEVVKG SPAAQAGLQK DDVIIGVNRD RVNSIAEMRK VLAAKPAIIA LQIVRGNESI YLLMR
|
| |