Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_1033 |
Symbol | |
ID | 3671258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | + |
Start bp | 1097808 |
End bp | 1099811 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637709716 |
Product | collagenase |
Protein accession | YP_314791 |
Protein GI | 74317051 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.771528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0721423 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTTTT GCCGCATGGA ACGCCCAGTG GACCCGCATC AACTCGAACT GCTCGCGCCT GCGCGCGACG CCGATATCGG TATAGAGGCC GTCAATCACG GCGCCGACGC GGTCTACATC GGCGGCCCCG CGTTCGGCGC GCGGGCGGCA GCGTGCAACG ATATCGCCGA CATCGCACGG CTCGTCGCAC ACGCCCATCG CTTCAACGCG CGCGTGTTCG CCACGCTCAA CACCATTCTC GCCGACGACG AACTCGAGCC GGCACGTCGC CAGATCTGGC AGCTTTACGA AGCCGGCGTC GACGCGCTGA TCATCCAGGA CATGGGACTC CTGCAGCTCG ACCTGCCGCC GATCCAGCTG CACGCGAGCA CCCAGTGCGA CATCCGCACG CCGGAGAAGG CGCGCTTCCT GCAGGATGTC GGGCTGTCGC AACTCGTGCT CGCACGCGAA CTCGATCTCC GGGAGATCGC CGCGATCCGC GCCGCGACCG ATCCGGCGAC CACGCTCGAA TTCTTCGTCC ACGGCGCGCT GTGCGTGGCC TACAGCGGGC AGTGCTACAT CAGCCATGCC CACACCGGGC GCAGCGCCAA TCGCGGGGAC TGCTCGCAGG CCTGCCGCCT GCCGTATCAG GTCACCGACA TGGAGGGCCG CATCGTCGCG CACGACAAAC ACGTGCTGTC GATGAAGGAC AACAATCAGA GTCGGAACCT CGAGGCGCTC GTCGACGCCG GCATCCGCAG CTTCAAGATC GAGGGGCGCT ACAAGGACAT GGGCTACGTG AAGAACATCA CCGCCCACTA CCGCGTTCTG CTCGACGAGA TCCTCGAACG GCGCCCGGAA TTCGCCCGCG CCTCGTCCGG GCGTACGTCG TTCAGCTTCA CGCCGGATCC CAACCAGAAC TTCAATCGCG AGTTCACCGA CTACTTCGTG GAAGGCCGCA AGGCCGACAT CGGTGCGTTC GACACGCCCA AGAATCCCGG CTTGCCGATC GGTCACGTCG TCAAGGTCGC TGACAAATGG CTCGAGGTGC GGGTCGATGC GGCCGATCTC GTGCTCAACA ACGGCGACGG GCTGTGCTAC TACACGCTGC AGAAGGAGTT GGCCGGTCTC GCCATCAACC GCGCCGAAAA AGCGGGCGAG GGCCTGTGGC GCGTGTTCCC CAAGGATCCG ATCGCAGGCT TCAAGGACTT GCGCGCCGGA ACCCCGGTCA ACCGCAACCG CGACATGAAC TGGACGCGCC TGCTCGCCCG ACCGTCGAGC GAACGCCGCA TCGCGGTCTG GCTGCGCTTT GGGGAAACGG CGGATGGCTT CGCGCTGACA TTGACCGACG AGGACGGCCA CACCGCGACC GTCTCAGCGC CTCACGCCAA GGAGGCGGCA AAGGACGCCG CGAAAAGCGA GCCGATGCTC CGCGAGCAAC TCGGAAAACT CGGCGGGACG CCTTTCGCCG CGACCGGCAT TGCGCTCGAA CTGAGCGCGT CCTGGTTTCT CCCCCCGTCG TTTCTCAATG CCCTGCGCCG CGACGCCGTC TCGGCGCTCG AGACCGCGCG CGCCCGCGCC TACGAACGCC CGCCGCGAGC GCGACCGATC GAGCCGCCGG TCGTCTACCC GGAGGACACG CTCTCGTATC TCGCCAATGT CGCCAACAGC AAAGCGCGCG ACTTTTACCT CAGGCACGGC GTCCGGGTCG TCGCCGCCGC TTATGAAGGG CACGCGGAGA CGGGCGAGGT GTCGCTGATG ATCACGCGCC ACTGCGTGCG CTATTCGCTG TCGCTGTGCC CGAAGCAGGC CAAGGGCGTG ACTGGCGTGC AGGGGACGGT GCGCGCCGCG CCGCTGACCC TCGTCAACGG CAGCGAAAAG CTCACCCTGC GCTTCGACTG CAAGCGCTGC GAGATGCACG TGCTCGGAAA GCTCAAGCCC GCGGTGGCGC GGCAGACGGC GCCGCTGACC TTTTATCGCT CGCGTCCTTC GGCGGCAGGC GACAACGCGG CGACCGGCAC GTGA
|
Protein sequence | MAFCRMERPV DPHQLELLAP ARDADIGIEA VNHGADAVYI GGPAFGARAA ACNDIADIAR LVAHAHRFNA RVFATLNTIL ADDELEPARR QIWQLYEAGV DALIIQDMGL LQLDLPPIQL HASTQCDIRT PEKARFLQDV GLSQLVLARE LDLREIAAIR AATDPATTLE FFVHGALCVA YSGQCYISHA HTGRSANRGD CSQACRLPYQ VTDMEGRIVA HDKHVLSMKD NNQSRNLEAL VDAGIRSFKI EGRYKDMGYV KNITAHYRVL LDEILERRPE FARASSGRTS FSFTPDPNQN FNREFTDYFV EGRKADIGAF DTPKNPGLPI GHVVKVADKW LEVRVDAADL VLNNGDGLCY YTLQKELAGL AINRAEKAGE GLWRVFPKDP IAGFKDLRAG TPVNRNRDMN WTRLLARPSS ERRIAVWLRF GETADGFALT LTDEDGHTAT VSAPHAKEAA KDAAKSEPML REQLGKLGGT PFAATGIALE LSASWFLPPS FLNALRRDAV SALETARARA YERPPRARPI EPPVVYPEDT LSYLANVANS KARDFYLRHG VRVVAAAYEG HAETGEVSLM ITRHCVRYSL SLCPKQAKGV TGVQGTVRAA PLTLVNGSEK LTLRFDCKRC EMHVLGKLKP AVARQTAPLT FYRSRPSAAG DNAATGT
|
| |