Gene Tbd_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1033 
Symbol 
ID3671258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp1097808 
End bp1099811 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content67% 
IMG OID637709716 
Productcollagenase 
Protein accessionYP_314791 
Protein GI74317051 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.771528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0721423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTTT GCCGCATGGA ACGCCCAGTG GACCCGCATC AACTCGAACT GCTCGCGCCT 
GCGCGCGACG CCGATATCGG TATAGAGGCC GTCAATCACG GCGCCGACGC GGTCTACATC
GGCGGCCCCG CGTTCGGCGC GCGGGCGGCA GCGTGCAACG ATATCGCCGA CATCGCACGG
CTCGTCGCAC ACGCCCATCG CTTCAACGCG CGCGTGTTCG CCACGCTCAA CACCATTCTC
GCCGACGACG AACTCGAGCC GGCACGTCGC CAGATCTGGC AGCTTTACGA AGCCGGCGTC
GACGCGCTGA TCATCCAGGA CATGGGACTC CTGCAGCTCG ACCTGCCGCC GATCCAGCTG
CACGCGAGCA CCCAGTGCGA CATCCGCACG CCGGAGAAGG CGCGCTTCCT GCAGGATGTC
GGGCTGTCGC AACTCGTGCT CGCACGCGAA CTCGATCTCC GGGAGATCGC CGCGATCCGC
GCCGCGACCG ATCCGGCGAC CACGCTCGAA TTCTTCGTCC ACGGCGCGCT GTGCGTGGCC
TACAGCGGGC AGTGCTACAT CAGCCATGCC CACACCGGGC GCAGCGCCAA TCGCGGGGAC
TGCTCGCAGG CCTGCCGCCT GCCGTATCAG GTCACCGACA TGGAGGGCCG CATCGTCGCG
CACGACAAAC ACGTGCTGTC GATGAAGGAC AACAATCAGA GTCGGAACCT CGAGGCGCTC
GTCGACGCCG GCATCCGCAG CTTCAAGATC GAGGGGCGCT ACAAGGACAT GGGCTACGTG
AAGAACATCA CCGCCCACTA CCGCGTTCTG CTCGACGAGA TCCTCGAACG GCGCCCGGAA
TTCGCCCGCG CCTCGTCCGG GCGTACGTCG TTCAGCTTCA CGCCGGATCC CAACCAGAAC
TTCAATCGCG AGTTCACCGA CTACTTCGTG GAAGGCCGCA AGGCCGACAT CGGTGCGTTC
GACACGCCCA AGAATCCCGG CTTGCCGATC GGTCACGTCG TCAAGGTCGC TGACAAATGG
CTCGAGGTGC GGGTCGATGC GGCCGATCTC GTGCTCAACA ACGGCGACGG GCTGTGCTAC
TACACGCTGC AGAAGGAGTT GGCCGGTCTC GCCATCAACC GCGCCGAAAA AGCGGGCGAG
GGCCTGTGGC GCGTGTTCCC CAAGGATCCG ATCGCAGGCT TCAAGGACTT GCGCGCCGGA
ACCCCGGTCA ACCGCAACCG CGACATGAAC TGGACGCGCC TGCTCGCCCG ACCGTCGAGC
GAACGCCGCA TCGCGGTCTG GCTGCGCTTT GGGGAAACGG CGGATGGCTT CGCGCTGACA
TTGACCGACG AGGACGGCCA CACCGCGACC GTCTCAGCGC CTCACGCCAA GGAGGCGGCA
AAGGACGCCG CGAAAAGCGA GCCGATGCTC CGCGAGCAAC TCGGAAAACT CGGCGGGACG
CCTTTCGCCG CGACCGGCAT TGCGCTCGAA CTGAGCGCGT CCTGGTTTCT CCCCCCGTCG
TTTCTCAATG CCCTGCGCCG CGACGCCGTC TCGGCGCTCG AGACCGCGCG CGCCCGCGCC
TACGAACGCC CGCCGCGAGC GCGACCGATC GAGCCGCCGG TCGTCTACCC GGAGGACACG
CTCTCGTATC TCGCCAATGT CGCCAACAGC AAAGCGCGCG ACTTTTACCT CAGGCACGGC
GTCCGGGTCG TCGCCGCCGC TTATGAAGGG CACGCGGAGA CGGGCGAGGT GTCGCTGATG
ATCACGCGCC ACTGCGTGCG CTATTCGCTG TCGCTGTGCC CGAAGCAGGC CAAGGGCGTG
ACTGGCGTGC AGGGGACGGT GCGCGCCGCG CCGCTGACCC TCGTCAACGG CAGCGAAAAG
CTCACCCTGC GCTTCGACTG CAAGCGCTGC GAGATGCACG TGCTCGGAAA GCTCAAGCCC
GCGGTGGCGC GGCAGACGGC GCCGCTGACC TTTTATCGCT CGCGTCCTTC GGCGGCAGGC
GACAACGCGG CGACCGGCAC GTGA
 
Protein sequence
MAFCRMERPV DPHQLELLAP ARDADIGIEA VNHGADAVYI GGPAFGARAA ACNDIADIAR 
LVAHAHRFNA RVFATLNTIL ADDELEPARR QIWQLYEAGV DALIIQDMGL LQLDLPPIQL
HASTQCDIRT PEKARFLQDV GLSQLVLARE LDLREIAAIR AATDPATTLE FFVHGALCVA
YSGQCYISHA HTGRSANRGD CSQACRLPYQ VTDMEGRIVA HDKHVLSMKD NNQSRNLEAL
VDAGIRSFKI EGRYKDMGYV KNITAHYRVL LDEILERRPE FARASSGRTS FSFTPDPNQN
FNREFTDYFV EGRKADIGAF DTPKNPGLPI GHVVKVADKW LEVRVDAADL VLNNGDGLCY
YTLQKELAGL AINRAEKAGE GLWRVFPKDP IAGFKDLRAG TPVNRNRDMN WTRLLARPSS
ERRIAVWLRF GETADGFALT LTDEDGHTAT VSAPHAKEAA KDAAKSEPML REQLGKLGGT
PFAATGIALE LSASWFLPPS FLNALRRDAV SALETARARA YERPPRARPI EPPVVYPEDT
LSYLANVANS KARDFYLRHG VRVVAAAYEG HAETGEVSLM ITRHCVRYSL SLCPKQAKGV
TGVQGTVRAA PLTLVNGSEK LTLRFDCKRC EMHVLGKLKP AVARQTAPLT FYRSRPSAAG
DNAATGT