Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1983 |
Symbol | |
ID | 5113399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 2155943 |
End bp | 2157907 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640492171 |
Product | peptidase U32 |
Protein accession | YP_001176710 |
Protein GI | 146311636 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.156367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTGC AATCCCATCA TCTTGAACTT TTAAGCCCGG CTCGCGACGC CTCCATCGCC CGTGAAGCGA TTCTTCACGG TGCGGACGCG GTCTATATCG GCGGCCCTGG CTTTGGCGCT CGCCATAACG CCAGCAACAG CCTGCAGGAT ATTGCGGAGC TGGTGCCGTT TGCCCACCGT TTTGGTGCAA AAGTGTTCGT GACCCTGAAC ACCATTCTTC ATGATGATGA GCTTGAACCC GCCCAACGAC TGATTACCGA CCTGTATCAG ACTGGAGTCG ATGCGCTGAT CGTTCAGGAC ATGGGCGTGC TCGAGCTGGA TATTCCGCCG ATTGAACTGC ATGCCAGTAC CCAGTGCGAT ATTCGTACCG TTGAAAAAGC GAAGTTTCTG TCTGACGTAG GCTTTACCCA GATCGTTCTG GCGCGCGAGC TGAATCTGAA TCAAATCCGC GACATTCACC AGGCCACTGA CGCCAACATC GAATTCTTCA TTCACGGCGC GCTGTGTGTG GCGTATTCCG GCCAGTGCAA TATTTCCCAT GCGCAGACCG GGCGCAGCGC CAACCGTGGC GATTGCTCGC AAGCGTGTCG TTTGCCTTAC ACGCTGAAAG ACGATCAGGG CCGCGTCGTG GCGTTCGAAA AACATCTGCT GTCGATGAAA GACAATGATC AGACGGCAAA CCTGGCGGCG CTCATCGACG CTGGCGTGCG CTCCTTCAAA ATTGAAGGGC GCTACAAAGA CATGAGCTAC GTGAAGAACA TCACCGCGCA TTATCGCCAG ATGCTTGACG CCATTATTGA AGATCGTGGC GACCTGGCGC GCTCGTCTGC TGGCCGCACC GAGCATTTCT TCATTCCGTC GACGGATAAA ACGTTCCACC GCGGCAGCAC GGATTACTTT GTGAATGCGC GTAAAGACGA TATCGGTGCG TTTGATTCGC CGAAATTTAT CGGCCTGCCG GTGGGTGAAG TGTTAAAAGT ATCCAAAGAT TATCTGGACG TAAAAGTGAC CGAAACGCTG GCTAACGGTG ACGGGCTGAA CGTGATGATC AAACGCGAAA TCGTCGGTTT CCGCGCCAAT ACCGTCGAGA AAACGGGTGA GAATCAGTAT CGCGTCTGGC CGAACGAAAT GCCTGCGGAT CTGTACAAAG CCCGCCCGAA TGCTGCGCTT AACCGTAACC TCGACCATAA CTGGCAGCAG GCGCTGTTGA AAACCTCCAG TGAACGTCGT ATTGCGGTGG ATATGGAGCT GGGTGGTTGG GAAGAACAGC TGATCCTGAC CATGACCAGT GAAGATGGCG TGAGCGTGAC CCATACCCTG GACGGTCAGT TTGAGGTGGC GAATAACGCA GAGAAGGCGA TGAACAGCCT GAAAGACGGC GTGGCGAAGC TGGGACAAAC GATCTATTAC GCCCGCGACA TTACGCTAAC GCTGCCGGAC GCACTGTTCG TGCCGAACAG TCAGCTTAAC CAGTTCCGCC GCGAAACCGC AGAAATGCTT GATGAGGCGC GCTTGGCCAA TTACCCGCGC GGGAGCCGCA AAGCGGTGTC TGTCCCTGCG CCGGTTTATC CGGATTCTCA TTTGTCATTC CTGGCGAACG TGTACAACCA CAAAGCACGC GAGTTTTATC ATCGTTACGG CGTGCAATTA ATTGATGCAG CTTATGAGGC GCACGAAGAG AAGGGCGATG TGCCGGTGAT GATCACCAAG CACTGTTTGC GCTTCGCCTT TAACCTGTGC CCGAAACAGG CGAAGGGCAA CATCAAAAGC TGGAAGGCCA CACCTATGCA GTTGGTGAAT GGTGATGAAG TGTTAACGTT GAAATTTGAC TGCCGTCCCT GCGAAATGCA CGTGATTGGC AAAATGAAAA ATCACATCTT CAAAATGCCA CAACCGGGAA GCGTTGTGGC CTCTGTTAGC CCCGAAGATC TGATGAAAAC CCTGCCGAAG CGCAAGGGCG TTTAA
|
Protein sequence | MRLQSHHLEL LSPARDASIA REAILHGADA VYIGGPGFGA RHNASNSLQD IAELVPFAHR FGAKVFVTLN TILHDDELEP AQRLITDLYQ TGVDALIVQD MGVLELDIPP IELHASTQCD IRTVEKAKFL SDVGFTQIVL ARELNLNQIR DIHQATDANI EFFIHGALCV AYSGQCNISH AQTGRSANRG DCSQACRLPY TLKDDQGRVV AFEKHLLSMK DNDQTANLAA LIDAGVRSFK IEGRYKDMSY VKNITAHYRQ MLDAIIEDRG DLARSSAGRT EHFFIPSTDK TFHRGSTDYF VNARKDDIGA FDSPKFIGLP VGEVLKVSKD YLDVKVTETL ANGDGLNVMI KREIVGFRAN TVEKTGENQY RVWPNEMPAD LYKARPNAAL NRNLDHNWQQ ALLKTSSERR IAVDMELGGW EEQLILTMTS EDGVSVTHTL DGQFEVANNA EKAMNSLKDG VAKLGQTIYY ARDITLTLPD ALFVPNSQLN QFRRETAEML DEARLANYPR GSRKAVSVPA PVYPDSHLSF LANVYNHKAR EFYHRYGVQL IDAAYEAHEE KGDVPVMITK HCLRFAFNLC PKQAKGNIKS WKATPMQLVN GDEVLTLKFD CRPCEMHVIG KMKNHIFKMP QPGSVVASVS PEDLMKTLPK RKGV
|
| |