Gene Tbd_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_0100 
Symbol 
ID3673878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp106695 
End bp108335 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content65% 
IMG OID637708760 
Productsignal transduction protein 
Protein accessionYP_313858 
Protein GI74316118 
COG category[T] Signal transduction mechanisms 
COG ID[COG1639] Predicted signal transduction protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.114046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.288231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTCCCG CTGACCCGCA CCCCGCCGAA GAAACGGCGC TGGACGATCC CGGCCTGCCG 
TTTCGCCAGA GCGCGCTCGA CATCCTGCTG CGCCGCATGC GCAGCGAAAG CGACTTCCCT
GCGCTGTCCG AGGCGATCGG TGCGATCAAC CGCATCGCCG CGTCGGATCG CGAGGGGGTC
AACGAGCTGT CGAACAACAT CCTCAAGGAC TTCGCGCTCA CCAACAAGCT CTTGCGGCTC
GCCAACGTCG CGTTCTACAA CCAGGTCGGC GGCGGCTCGA TCAGCACGAT CTCGCGTGCG
GTCGTGATCC TCGGTTTCGA CGCCGTGCGC TCGATCGCGC TGAGCCTCAT CCTGTTCGAC
AACCTCGAAA ACAAGGCGCA CGCACAGCAG CTCAAGGAAG AGTTCGTCAA GCTGCTCTAC
GCCGGGATGC TGGCGCGCGA AATGGCGGGC AAGGCGCAGG TGGCCGACGT CGAAGAGGCG
TTCATCGGCG CGATGCTGCA CAAGCTCGGG CGCATGCTGG CGATGTTCTA CTTTCCCGCG
GAAACCGCGC AGATCGAGGA ACGGATCGCG GCCGAGGGGC TCGGCGACAG CCGGGTGTCG
AGCGAAGTGC TCGGCGTCTC GTTCGAAAAC CTCGGCATCG GCATCGCGCG CAGCTGGGGT
TTCCCCGATC AGCTCGTGCA GAGCATGAAG AAACTTCCGG AAGGCAAGCT CAGGCGAAGC
ACCGCCGGCG CCGACCGCCT GCGCGCGCTC GCCGGGTTTT CCAATGCCTT GTGTGAAGCC
ATCCTCGACA CCCCCGACAG CGAACGCGGC AAGGCGCTGG CGAAGATCAC CGGGCGTTTC
AGCGACGTCG TGCCGATCGG CGTCGAACAA CTCGCCGAGG TGATGGAAAA GTCGATGCAC
GACTTCGCGC AATTCGCCCT CGCCGTGAAT GTGAACCTCA AGCAGAGCGA TTTCGCCCAG
CAGGCGTCGA AATGGGCCGG CGTGCGCATG CCGGCGGCGT CCACCGATCC GTCCGCCAGC
GCCGACGACC GCGCCGCGCT CGAGTCGACG ATGCTGCACG AGCACGCGCC GATACTCGAC
GACGCGTCCG CGGCCGTGCC CGAGAGTGCA CCGCGCAGCA GCGCTGAAAT CCAGGCCGCG
CTCAGCTCGG GGCTGCAGGA CGTCGGCAAT TCCCTGATCG ACGACAACGT GTCGATCAAC
GACATCCTGC GCATGATTCT CGAGGCGATG TATACCGGCA TGGGCTTCGA CCACGTCGTG
CTGTGCATCA AGGATGGGCG TCGCAATGCG ATGTGCGGCA AGTTCGGTTT CGGCGACGGC
GTGCAGGACC TGATCAGGGC CTTCGACTTT CCGCTCACGG CGCCGGCGGA CGTCTTTCTG
GTCGCCCTGC AGCAGAACGC GGACATCCTC ATTACCGACA TCGACGACGC CAAGATCGCG
ACGCGCATCC CCGCCTGGTA TCGCGCGCGC GTCGCGGCAC ACACCTTCGC GCTCTTTCCG
ATCATCGTCC GCGGCAAGGC CGTGGGGCTG ATTTACGCCG ACCGCGCGCG CCCCGGCGAC
ATCACGATCC CGGAAAAGGA ATTGTCGCTG CTGAAGTCCC TGCGCAACCA GGCCGTACTC
GCCATCCGCC AGTCGGTGTA G
 
Protein sequence
MPPADPHPAE ETALDDPGLP FRQSALDILL RRMRSESDFP ALSEAIGAIN RIAASDREGV 
NELSNNILKD FALTNKLLRL ANVAFYNQVG GGSISTISRA VVILGFDAVR SIALSLILFD
NLENKAHAQQ LKEEFVKLLY AGMLAREMAG KAQVADVEEA FIGAMLHKLG RMLAMFYFPA
ETAQIEERIA AEGLGDSRVS SEVLGVSFEN LGIGIARSWG FPDQLVQSMK KLPEGKLRRS
TAGADRLRAL AGFSNALCEA ILDTPDSERG KALAKITGRF SDVVPIGVEQ LAEVMEKSMH
DFAQFALAVN VNLKQSDFAQ QASKWAGVRM PAASTDPSAS ADDRAALEST MLHEHAPILD
DASAAVPESA PRSSAEIQAA LSSGLQDVGN SLIDDNVSIN DILRMILEAM YTGMGFDHVV
LCIKDGRRNA MCGKFGFGDG VQDLIRAFDF PLTAPADVFL VALQQNADIL ITDIDDAKIA
TRIPAWYRAR VAAHTFALFP IIVRGKAVGL IYADRARPGD ITIPEKELSL LKSLRNQAVL
AIRQSV