Gene Tbd_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_0040 
Symbol 
ID3673247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp40581 
End bp42122 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content68% 
IMG OID637708699 
Producthypothetical protein 
Protein accessionYP_313798 
Protein GI74316058 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.257159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAG CGGCTTCGGC CGCTTTTTGC ATTGTTGCGG CAAGCCAGCG TTTATCATTG 
CGTTCCGGAC GAATAGATGA GGCAGATGTG ACACCCCGAG GGGCTTTCCG CCGTATTGCC
ATGATCCTGG CCGGCGCGCT GGCGCTGAGT CATTGCGCGC AGAATCCCGT GAGCGGCGAG
CGCGACTTCG TGCTGCTGTC CGAGCAGCAG GAAGCCGAAA TGGGCGCGCA GGCGCATCGC
GACGTGCTGA AGGAATACGC GGCGCTCGAC GCGCCCGAAC TGCAGGCCTA TGTCGACGCG
GTCGGCCAGC GTCTCGCGAA GCAAAGCCAT CGCCCGGGGC TGACCTGGCA CTTCACGGTC
ATCGACAGCC CCGACGTCAA CGCCTTCGCA TTGCCCGGCG GCTATGTGTA CGTCACACGC
GGCATCCTGG CCTATCTGAA TTCGGAAGCC GAACTTGCCG GCGTCGTCGG CCACGAGATC
GGCCACGTCA CCGCGCGCCA CGGCGTGCGC CAGCAAAGCG CCGCCACGGC CGCCGGCCTC
GGCACCGTGC TCGGATCGAT CCTCGTGCCG GGACTGGACA ACCAGGCGGG CGCCTCACTG
TTGCAGACGC TCGCGCAGGC CTGGACCGCC GGGTATGGCC GCGAACACGA ACTCGAGGCC
GACCGGCTCG GCGCGCAATA CCTCGCGAAA ACCGGCTACC GGCCCGAGGC GATGATTGAC
GTCATCGGCG TGCTCAAGAA CCAGGAACGC TTTGCCGCCG AGAGGGCCAA GCGTGACGGC
ACCAAGCCGC GCACCTACCA CGGCACCTTC GATACGCACC CGAGCAACGA CAAGCGTCTG
CAGCAGGTGG TGAACGAGGC GAAGCGCTAT CGGGTCGCGG CGCCGCGCGA GGGGCGCAGC
GAGTATCTCG AGAAGATCGC CGGCGTCTAC TTCGGCGACA GCCCCGAGCA GGGGCTCGTC
CGCGACAACC TGCTGGTGCA CGAGAAACTC GGCCTGGCGA TGCAATTTCC GCCCGCGTGG
CATGTGCAGA ACCATCCCGA CCGGGTGGCA GCGACGAGCC CCGGCGGCGA CGCGATGATC
GAGATCCTGG CCGGGCCGCG GAACGCGCGA CCGCTCGACA CGCTGAAGAA GGGCATCCGG
CTCGATCCCG GCGCGCGCTA CGACAGCGGC AACCTCGGCG GCTTCCCCGC GGCGTTTGCC
GCCGGTGCCC AGCAGGGTAG GCCGGTCGTC GTCGCCGCCG TGGTGTTCAA GGACAGGCAG
TACCTGATCG CCGGCATGAC GCGGGACAAA ACCGCCTACC AGAAGCAACG CGGTACCCTG
CGCGCGGCGA TCAACAGCTT CCGCGAGACG ACCGGCGCCG ACAGGGCGCG TGCGCGTCCC
TATCGCCTGA AGCTCGTGAC GGCCAAGCAC GGCACGACGA TGGCCGAAGT CGCGCGGCAG
AGTCCGCTCG GCGCCGACGG CGAGAGCCAG TTGCGCCTCA TGAACGACCT CTATCCCGGC
GGCGAGCCCA AGGCGGGCCA GCGCCTCAAA GTTGTCGACT GA
 
Protein sequence
MLKAASAAFC IVAASQRLSL RSGRIDEADV TPRGAFRRIA MILAGALALS HCAQNPVSGE 
RDFVLLSEQQ EAEMGAQAHR DVLKEYAALD APELQAYVDA VGQRLAKQSH RPGLTWHFTV
IDSPDVNAFA LPGGYVYVTR GILAYLNSEA ELAGVVGHEI GHVTARHGVR QQSAATAAGL
GTVLGSILVP GLDNQAGASL LQTLAQAWTA GYGREHELEA DRLGAQYLAK TGYRPEAMID
VIGVLKNQER FAAERAKRDG TKPRTYHGTF DTHPSNDKRL QQVVNEAKRY RVAAPREGRS
EYLEKIAGVY FGDSPEQGLV RDNLLVHEKL GLAMQFPPAW HVQNHPDRVA ATSPGGDAMI
EILAGPRNAR PLDTLKKGIR LDPGARYDSG NLGGFPAAFA AGAQQGRPVV VAAVVFKDRQ
YLIAGMTRDK TAYQKQRGTL RAAINSFRET TGADRARARP YRLKLVTAKH GTTMAEVARQ
SPLGADGESQ LRLMNDLYPG GEPKAGQRLK VVD