Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1812 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 1962463 |
End bp | 1964511 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | carboxyl-terminal protease |
Protein accession | ACX39471 |
Protein GI | 260449049 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.412935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATGT TTTTTAGGCT TACCGCGTTA GCTGGCCTGC TTGCAATAGC AGGCCAGACC TTCGCTGTAG AAGATATCAC GCGTGCTGAT CAAATTCCGG TATTAAAGGA AGAGACGCAG CATGCGACGG TAAGTGAGCG CGTAACGTCG CGCTTCACCC GTTCTCATTA TCGCCAGTTC GACCTCGATC AGGCATTTTC GGCCAAAATC TTTGACCGCT ACCTGAATCT GCTCGATTAC AGCCACAACG TGCTGCTGGC AAGCGATGTT GAACAGTTCG CGAAAAAGAA AACCGAGTTA GGCGATGAAC TGCGTTCAGG CAAACTCGAC GTTTTCTACG ATCTCTACAA TCTGGCGCAA AAGCGCCGTT TTGAGCGTTA CCAGTACGCT TTGTCGGTAC TGGAAAAGCC GATGGATTTC ACCGGCAACG ACACTTATAA CCTTGACCGC AGCAAAGCGC CCTGGCCGAA AAACGAGGCT GAGTTGAACG CGCTGTGGGA CAGTAAAGTC AAATTCGACG AGTTAAGCCT GAAGCTGACA GGAAAAACGG ATAAAGAAAT TCGTGAAACC CTGACTCGCC GCTACAAATT TGCCATTCGT CGTCTGGCGC AAACCAACAG CGAAGATGTT TTCTCGCTGG CAATGACGGC GTTTGCGCGT GAAATCGACC CGCATACCAA CTATCTTTCC CCGCGTAATA CCGAACAGTT CAACACTGAA ATGAGTTTGT CGCTGGAAGG TATTGGCGCA GTGCTGCAAA TGGATGATGA CTACACCGTT ATCAATTCGA TGGTGGCAGG TGGTCCGGCA GCGAAGAGTA AAGCTATCAG CGTTGGTGAC AAAATTGTCG GTGTTGGTCA AACAGGCAAG CCGATGGTTG ACGTGATTGG CTGGCGTCTT GATGATGTGG TTGCCTTAAT TAAAGGGCCG AAGGGCAGTA AAGTTCGTCT GGAAATTTTA CCTGCTGGTA AAGGGACCAA GACCCGTACT GTAACGTTGA CCCGTGAACG TATTCGTCTC GAAGACCGCG CGGTTAAAAT GTCGGTGAAG ACCGTCGGTA AAGAGAAAGT CGGCGTGCTG GATATTCCGG GCTTCTATGT GGGTTTGACA GACGATGTCA AAGTGCAACT GCAGAAACTG GAAAAACAGA ATGTCAGCAG CGTCATCATC GACCTGCGTA GCAATGGCGG TGGGGCGTTA ACTGAAGCCG TATCGCTCTC CGGTCTGTTT ATTCCTGCGG GTCCCATTGT TCAGGTCCGC GATAACAACG GCAAGGTTCG TGAAGATAGC GATACCGACG GACAGGTTTT CTATAAAGGC CCGCTGGTGG TGCTGGTTGA CCGCTTCAGT GCTTCGGCTT CAGAAATCTT TGCCGCGGCA ATGCAGGATT ACGGTCGTGC GCTGGTTGTG GGTGAACCGA CGTTTGGTAA AGGCACCGTT CAGCAATACC GTTCATTGAA CCGTATTTAC GATCAGATGT TACGTCCTGA ATGGCCAGCG CTGGGTTCTG TGCAGTACAC GATCCAGAAA TTCTATCGCG TTAACGGCGG CAGTACGCAA CGTAAAGGCG TAACGCCAGA CATCATCATG CCGACGGGTA ATGAAGAAAC GGAAACGGGT GAGAAATTCG AAGATAACGC GCTGCCGTGG GATAGCATTG ATGCCGCGAC TTATGTGAAA TCAGGAGATT TAACGGCCTT TGAACCGGAG CTGCTGAAGG AACATAATGC GCGTATCGCG AAAGATCCTG AGTTCCAGAA CATCATGAAG GATATCGCGC GCTTCAACGC TATGAAGGAC AAGCGCAATA TCGTTTCTCT GAATTACGCT GTGCGTGAGA AAGAGAATAA TGAAGATGAT GCGACGCGTC TGGCGCGTTT GAACGAACGC TTTAAACGCG AAGGTAAACC GGAGTTGAAG AAACTGGATG ATCTACCGAA AGATTACCAG GAGCCGGATC CTTATCTGGA TGAGACGGTG AATATCGCAC TCGATCTGGC GAAGCTTGAA AAAGCCAGAC CCGCGGAACA ACCCGCTCCC GTCAAGTAA
|
Protein sequence | MNMFFRLTAL AGLLAIAGQT FAVEDITRAD QIPVLKEETQ HATVSERVTS RFTRSHYRQF DLDQAFSAKI FDRYLNLLDY SHNVLLASDV EQFAKKKTEL GDELRSGKLD VFYDLYNLAQ KRRFERYQYA LSVLEKPMDF TGNDTYNLDR SKAPWPKNEA ELNALWDSKV KFDELSLKLT GKTDKEIRET LTRRYKFAIR RLAQTNSEDV FSLAMTAFAR EIDPHTNYLS PRNTEQFNTE MSLSLEGIGA VLQMDDDYTV INSMVAGGPA AKSKAISVGD KIVGVGQTGK PMVDVIGWRL DDVVALIKGP KGSKVRLEIL PAGKGTKTRT VTLTRERIRL EDRAVKMSVK TVGKEKVGVL DIPGFYVGLT DDVKVQLQKL EKQNVSSVII DLRSNGGGAL TEAVSLSGLF IPAGPIVQVR DNNGKVREDS DTDGQVFYKG PLVVLVDRFS ASASEIFAAA MQDYGRALVV GEPTFGKGTV QQYRSLNRIY DQMLRPEWPA LGSVQYTIQK FYRVNGGSTQ RKGVTPDIIM PTGNEETETG EKFEDNALPW DSIDAATYVK SGDLTAFEPE LLKEHNARIA KDPEFQNIMK DIARFNAMKD KRNIVSLNYA VREKENNEDD ATRLARLNER FKREGKPELK KLDDLPKDYQ EPDPYLDETV NIALDLAKLE KARPAEQPAP VK
|
| |