Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2224 |
Symbol | |
ID | 6064980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2442599 |
End bp | 2444602 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641601630 |
Product | peptidase U32 |
Protein accession | YP_001725189 |
Protein GI | 170020235 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.787396 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTAAAA TAGCCGCCAT TTTTCAGCTA CTGGATAAGA ATGTGACCGT ATCTTCTCAT CGACTTGAAC TGTTAAGCCC GGCACGCGAT GCCGCCATTG CCCGCGAAGC TATTTTGCAC GGTGCCGATG CTGTTTATAT CGGCGGCCCT GGTTTTGGTG CCCGTCATAA TGCCAGTAAT AGCTTGAAAG ATATTGCCGA GCTGGTGCCG TTTGCCCATC GTTATGGTGC AAAAATTTTC GTCACGCTTA ACACCATTTT GCATGATGAT GAGCTGGAAC CCGCGCAACG GCTGATTACT GACCTCTACC AGACCGGTGT CGATGCGCTG ATTGTTCAGG ATATGGGGAT TCTGGAACTT GATATTCCGC CGATTGAACT GCACGCCAGT ACGCAGTGCG ACATTCGTAC AGTTGAAAAA GCGAAGTTCC TCTCTGATGT TGGCTTCACG CAGATTGTGC TGGCGCGAGA GCTGAATCTT GATCAGATCC GCGCGATTCA CCAGGCTACG GACGCGACCA TTGAATTCTT TATTCATGGG GCACTGTGCG TGGCCTATTC GGGTCAGTGC TACATTTCTC ATGCGCAAAC AGGGCGTAGC GCCAACCGTG GCGATTGCTC GCAGGCGTGC CGTTTGCCAT ACACATTGAA AGACGATCAG GGGCGGGTGG TTTCCTATGA AAAACATCTG CTGTCGATGA AAGATAACGA TCAGACTGCC AACCTCGGCG CGCTGATTGA TGCTGGTGTA CGCTCCTTCA AGATTGAAGG GCGTTACAAA GATATGAGCT ACGTGAAGAA TATCACCGCC CATTATCGCC AGATGCTTGA TGCCATTATT GAAGAACGTG GCGATCTGGC GCGCGCTTCA TCAGGTCGTA CTGAACATTT CTTTGTTCCA TCGACGGAAA AGACTTTCCA CCGTGGTAGC ACAGATTATT TTGTGAATGC CCGTAAAGGC GATATTGGCG CGTTCGATTC GCCGAAATTT ATCGGCCTGC CGGTAGGCGA AGTATTGAAA GTGGCGAAAG ATCATCTCGA TGTTGCCGTT ACCGAGCCAC TGGCAAATGG CGATGGCCTG AACGTGTTGA TTAAACGTGA AGTCGTCGGT TTTCGTGCCA ATACGGTCGA GAAAACCGGA GAAAATCAGT ACCGCGTCTG GCCCAATGAA ATGCCAGCAG ATTTGCACAA AATTCGTCCA CATCACCCAC TAAACCGTAA TCTTGATCAT AACTGGCAGC AGGCACTGAC AAAAACCTCC AGCGAACGTC GGGTGGCGGT AGACATTGAA CTGGGCGGCT GGCAGGAACA ACTGATTCTG ACCCTCACCA GTGAAGAGGG TGTCAGCATC ACGCATACGC TGGACGGGCA GTTCGACGAA GCCAATAACG CCGAAAAAGC AATGAACAAT CTGAAGGATG GTCTGGCAAA ACTGGGGCAA ACCCTCTATT ACGCCCGCGA TGTGCAAATT AATTTGCCGG GGGCGCTGTT TGTACCAAAC AGTCTGTTAA ACCAGTTCCG CCGTGAAGCT GCTGACATGC TGGATGCTGC GCGTCTTGCC AGTTACCAGC GCGGCAGCCG TAAACCGGTT GCTGATCCTG CGCCGGTTTA TCCGCAAACG CATCTGAGTT TCCTCGCGAA CGTATACAAC CAGAAAGCGC GTGAATTTTA TCATCGCTAT GGTGTGCAGC TGATTGACGC GGCGTATGAA GCACATGAAG AGAAGGGCGA AGTCCCGGTG ATGATCACCA AGCATTGTCT GCGCTTTGCC TTTAATCTGT GCCCGAAACA GGCGAAAGGC AATATCAAAA GCTGGAAGGC GACGCCAATG CAACTGGTTA ACGGCGATGA AGTATTAACG CTAAAGTTTG ATTGCCGCCC ATGCGAGATG CACGTCATTG GCAAAATCAA AAATCACATA CTGAAAATGC CGTTACCGGG AAGCGTAGTG GCATCCGTAA GTCCGGATGA GCTGCTGAAA ACATTGCCTA AGCGAAAAGG GTAA
|
Protein sequence | MAKIAAIFQL LDKNVTVSSH RLELLSPARD AAIAREAILH GADAVYIGGP GFGARHNASN SLKDIAELVP FAHRYGAKIF VTLNTILHDD ELEPAQRLIT DLYQTGVDAL IVQDMGILEL DIPPIELHAS TQCDIRTVEK AKFLSDVGFT QIVLARELNL DQIRAIHQAT DATIEFFIHG ALCVAYSGQC YISHAQTGRS ANRGDCSQAC RLPYTLKDDQ GRVVSYEKHL LSMKDNDQTA NLGALIDAGV RSFKIEGRYK DMSYVKNITA HYRQMLDAII EERGDLARAS SGRTEHFFVP STEKTFHRGS TDYFVNARKG DIGAFDSPKF IGLPVGEVLK VAKDHLDVAV TEPLANGDGL NVLIKREVVG FRANTVEKTG ENQYRVWPNE MPADLHKIRP HHPLNRNLDH NWQQALTKTS SERRVAVDIE LGGWQEQLIL TLTSEEGVSI THTLDGQFDE ANNAEKAMNN LKDGLAKLGQ TLYYARDVQI NLPGALFVPN SLLNQFRREA ADMLDAARLA SYQRGSRKPV ADPAPVYPQT HLSFLANVYN QKAREFYHRY GVQLIDAAYE AHEEKGEVPV MITKHCLRFA FNLCPKQAKG NIKSWKATPM QLVNGDEVLT LKFDCRPCEM HVIGKIKNHI LKMPLPGSVV ASVSPDELLK TLPKRKG
|
| |