Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3441 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 3689069 |
End bp | 3690493 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | protease Do |
Protein accession | ACX41056 |
Protein GI | 260450634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CCACATTAGC ACTGAGTGCA CTGGCTCTGA GTTTAGGTTT GGCGTTATCT CCGCTCTCTG CAACGGCGGC TGAGACTTCT TCAGCAACGA CAGCCCAGCA GATGCCAAGC CTTGCACCGA TGCTCGAAAA GGTGATGCCT TCAGTGGTCA GCATTAACGT AGAAGGTAGC ACAACCGTTA ATACGCCGCG TATGCCGCGT AATTTCCAGC AGTTCTTCGG TGATGATTCT CCGTTCTGCC AGGAAGGTTC TCCGTTCCAG AGCTCTCCGT TCTGCCAGGG TGGCCAGGGC GGTAATGGTG GCGGCCAGCA ACAGAAATTC ATGGCGCTGG GTTCCGGCGT CATCATTGAT GCCGATAAAG GCTATGTCGT CACCAACAAC CACGTTGTTG ATAACGCGAC GGTCATTAAA GTTCAACTGA GCGATGGCCG TAAGTTCGAC GCGAAGATGG TTGGCAAAGA TCCGCGCTCT GATATCGCGC TGATCCAAAT CCAGAACCCG AAAAACCTGA CCGCAATTAA GATGGCGGAT TCTGATGCAC TGCGCGTGGG TGATTACACC GTAGCGATTG GTAACCCGTT TGGTCTGGGC GAGACGGTAA CTTCCGGGAT TGTCTCTGCG CTGGGGCGTA GCGGCCTGAA TGCCGAAAAC TACGAAAACT TCATCCAGAC CGATGCAGCG ATCAACCGTG GTAACTCCGG TGGTGCGCTG GTTAACCTGA ACGGCGAACT GATCGGTATC AACACCGCGA TCCTCGCACC GGACGGCGGC AACATCGGTA TCGGTTTTGC TATCCCGAGT AACATGGTGA AAAACCTGAC CTCGCAGATG GTGGAATACG GCCAGGTGAA ACGCGGTGAG CTGGGTATTA TGGGGACTGA GCTGAACTCC GAACTGGCGA AAGCGATGAA AGTTGACGCC CAGCGCGGTG CTTTCGTAAG CCAGGTTCTG CCTAATTCCT CCGCTGCAAA AGCGGGCATT AAAGCGGGTG ATGTGATCAC CTCACTGAAC GGTAAGCCGA TCAGCAGCTT TGCCGCACTG CGTGCTCAGG TGGGTACTAT GCCGGTAGGC AGCAAACTGA CCCTGGGCTT ACTGCGCGAC GGTAAGCAGG TTAACGTGAA CCTGGAACTG CAGCAGAGCA GCCAGAATCA GGTTGATTCC AGCTCCATCT TCAACGGCAT TGAAGGCGCT GAGATGAGCA ACAAAGGCAA AGATCAGGGC GTGGTAGTGA ACAACGTGAA AACGGGCACT CCGGCTGCGC AGATCGGCCT GAAGAAAGGT GATGTGATTA TTGGCGCGAA CCAGCAGGCA GTGAAAAACA TCGCTGAACT GCGTAAAGTT CTCGACAGCA AACCGTCTGT GCTGGCACTC AACATTCAGC GCGGCGACAG CACCATCTAC CTGTTAATGC AGTAA
|
Protein sequence | MKKTTLALSA LALSLGLALS PLSATAAETS SATTAQQMPS LAPMLEKVMP SVVSINVEGS TTVNTPRMPR NFQQFFGDDS PFCQEGSPFQ SSPFCQGGQG GNGGGQQQKF MALGSGVIID ADKGYVVTNN HVVDNATVIK VQLSDGRKFD AKMVGKDPRS DIALIQIQNP KNLTAIKMAD SDALRVGDYT VAIGNPFGLG ETVTSGIVSA LGRSGLNAEN YENFIQTDAA INRGNSGGAL VNLNGELIGI NTAILAPDGG NIGIGFAIPS NMVKNLTSQM VEYGQVKRGE LGIMGTELNS ELAKAMKVDA QRGAFVSQVL PNSSAAKAGI KAGDVITSLN GKPISSFAAL RAQVGTMPVG SKLTLGLLRD GKQVNVNLEL QQSSQNQVDS SSIFNGIEGA EMSNKGKDQG VVVNNVKTGT PAAQIGLKKG DVIIGANQQA VKNIAELRKV LDSKPSVLAL NIQRGDSTIY LLMQ
|
| |