Gene Tbd_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_2091 
Symbol 
ID3671993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp2182805 
End bp2184199 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content67% 
IMG OID637710794 
Productpeptidase S1C, Do 
Protein accessionYP_315849 
Protein GI74318109 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAGAG CGGCTCTGGC CATGACGAGC TTCGTGTTTG TGCTGGCGGC GCCGGTCGCG 
GAAGCGGTCG ACGTCGCCGA TCTGGTCGAA AAGCAGGGTC CGGCGGTGGT CAACATCAGC
ACCACCAAGC TGGTCAAGCG CGGCGCCGAG GGCTTTCCGT TCGCGGTCCC GGAAGACCCC
GAGATGCAGG AATTCTTTCG CCGCTTCTTT CCGGGCGTGC CGGGTCGGGC GCCCGGGGCA
CCGGCGCAGG AGTTTCCGGC GCATGGCGCC GGCTCCGGCT TCATCGTTAG TAGCGACGGC
TACATCCTGA CCAACGCCCA CGTCGTGAAA GGTGCGGACG AAGTCGTCGT CAAGCTGACC
GACAAACGCA AGTTCATCGC CAAGGTCGTG GGTTCGGACC CGCGCACCGA CGTGGCCGTG
ATCCGCATCA CGGCGCGCAA CCTGCCGGCG GTGCGCCTCG GCGACCCGGA AAAACTTCGC
GTCGGTGAGG CAGTGGCTGC GATCGGCTCG CCTTTCGGCT TCGAGAACTC GGTCACCGCG
GGCATCGTGT CGGCCAAGGG CCGCTCGCTG CCGTCCGAAA GCTACGTGCC CTTCATCCAG
ACCGACGTCG CGGTTAACCC GGGTAATTCG GGGGGGCCGC TGTTCAACAT GCGCGGCGAA
GTCGTCGGCA TCAACTCGCA GATCTACAGC CAGAGCGGCG GCTACCAGGG GGTGGCGTTC
GCGATCCCGA TCGACATCGC GATGGAGGTC GTCGACCAGT TGAAGGCTGG CGGCAAGGTC
TCGCGCGGCT GGCTTGGCGT CATGATCCAG GAGGTCAGCG CGGACCTCGC CGAATCCTTC
GGCCTCGACC GGCCGCGCGG CGCGCTCGTG TCGCAGGTAC AGGATGGAAG CCCCGCGGCC
CGTGCGGGCG TCCAGACCGC CGACGTGATC CTCAGCTTCA ACGGCAAGCC GGTCGAGAAT
TCTGGCGACC TGCCGCGCAT CGTCGGCAGC ACCAAGCCCG GGTCGAAGAT CCCGATGCAG
GTCTGGCGGC GCGGCAAAAT GCAGACCCTG CAGGTCGTCC TGGCCGAGCT GCCGAGCGAA
GAGCAGGTCG CCGGCGCGGG CAAGAGCGGC AAGAGCTACT CGCGCGGCGG CCTCGCGCTG
TCCGAACTCA ACCCCGAACA GCGGCGCGAG CTCAAGATCG ACCACGGCCT GCTCGTCGAG
GAAGTCACCG GCGACGCCGC TCGGGCCGGC ATCCGGGTGG GGGACATCGT CCTCGCCGTC
AACAATGCAA GGATCGCGAC CGTCGACGCG TTCCGCCAGG CGATCGCGGC GATCCCGAAA
GGCAAGAGCG CTGCGCTCCT GGTGCGGCGC GGCGAAGGAT CGCTGTACAT CCCGCTGAAG
ATTTCGGGTG AGTAA
 
Protein sequence
MLRAALAMTS FVFVLAAPVA EAVDVADLVE KQGPAVVNIS TTKLVKRGAE GFPFAVPEDP 
EMQEFFRRFF PGVPGRAPGA PAQEFPAHGA GSGFIVSSDG YILTNAHVVK GADEVVVKLT
DKRKFIAKVV GSDPRTDVAV IRITARNLPA VRLGDPEKLR VGEAVAAIGS PFGFENSVTA
GIVSAKGRSL PSESYVPFIQ TDVAVNPGNS GGPLFNMRGE VVGINSQIYS QSGGYQGVAF
AIPIDIAMEV VDQLKAGGKV SRGWLGVMIQ EVSADLAESF GLDRPRGALV SQVQDGSPAA
RAGVQTADVI LSFNGKPVEN SGDLPRIVGS TKPGSKIPMQ VWRRGKMQTL QVVLAELPSE
EQVAGAGKSG KSYSRGGLAL SELNPEQRRE LKIDHGLLVE EVTGDAARAG IRVGDIVLAV
NNARIATVDA FRQAIAAIPK GKSAALLVRR GEGSLYIPLK ISGE