Gene Tbd_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1789 
Symbol 
ID3671453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp1889047 
End bp1890699 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content56% 
IMG OID637710487 
Productputative alpha-L-arabinofuranosidase 
Protein accessionYP_315547 
Protein GI74317807 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0379328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACAG TTACCGTACG CACCCGTCGC GGAATAAAGT CAGGTTTGAC GGCGACAGCC 
ACACGTTCCG TCCGAACCAG CGCCTCAGCG CTCATATTGT GGGGGGGATT GCATGTGTCT
GCCGACGCAG CGGAGCAAAC GGCTTCGCAG TGGAAGCTGC CGCGCGCTGT TCAGATTCAG
CGGTCACAAG AGGTGCTGAT CGAGGTAGAC CCTAACAGTG TCATCCGAAG TGCCGTTCCC
GCTGCCCTGT TCGGTTTCAA TATTCCTTGG ATGAACTTTC AGCGGGGATA TTGGCGGGAG
AACCAAGTGA GGCCGGAGGT CATTGCGTGG CTCAAACCGT TTTCCGGCGC CGTATATCGG
TACCCCGGCG GGGAGTGGTC AAACTGGTTC GAGTGGGAAA AGGCCGCGGG CCCGGGATCC
TCGAGGCCCG AGCAGTATAC GACCTTCGGA TCCACTAAAG CGGAATTCGG TTTTGATGAG
TTCCTCGACT TTGTAAAAGC CGTGAATGGG GTGCCTCTGG TGACGGTTAA TCTCAAGGGT
ACAAAGGGGG CTCCTTGGAG TGACCAGCAA GCGGTGGAGA GCAATGTTGA ATGGCTCCAA
CATTCTGTTC AGCGTGAGGG AAGAGCGGTT GCACGTGGTA ACGCGCAGTT CTGCCAAACG
GGAACAAAGT GTCCTGTCGA GTGGTGGGAA CTGGGTAACG AGCTCGATTG GGGCAAAGGC
GCTTGGACAC ATACCGAGTA CGTCAACCGG GCGCGGGAAG TCGGGCTCGC GATGAAAAAT
ATCGATCCGG CCATCAAGCT TATCGCTCAC ACAACGAGTT CGCCGTGGAG CACAAAACGC
GTTATCGGGG AAAAGCCGAA GGCTAGGGAT TTCGATAGCG CCGTCGGCGC TGGGTTAGGA
GATCTTGCCT ATGGATATGC CTACCATCCC TACTATGACG GGATCAGTAT TCCCGCTGCA
AATCACTACA TGGAGCGCGC GATTAATTAT TTGCGGCCTA CTGCATCGAC TGGCAAGCCT
CCTCCCATAT TTGTCACGGA GCACGGCCGG TGGCCGAACC AACCCAAGTT CGGCAAGTGG
GAAACCACTT GGATAAAAAC CGGAAATTTA GGCGGGGCAG TCTCGACTGC GGACTTCCTT
CTCTCCCAAA TGGTGATTCC GGATGTACGA GCCGCCATGT GGCATTCGCT CGGGGCGCGA
GGGCCGTGGC AATTGTTCTA TCTTGACGCG GCCGCCGATG CTCTTTATCC AAACGTCGTT
TATTGGGCAT TACGGGTGCT TCGCGAGGGG TTAATTGGCG ACGCTCTAAA GGTGAGCGTT
TCTTCTCCTA ACACAAGCGG TTATCGCGGC GGCTACGATG TCCGTGCCGT ATTCATGCGC
GACAGTACGC ACACACGTTA CAGCTTGCTA ACGATTAATC GCGCTGAGAA AGAGCAGAAA
GCGCGCTTGC TCTTCCCGGA GTGGGCAGGC CGCAAGGTAA AGGGGCAACA GTACTACGTT
AGTGGCGACA GTGATGCGCT CGCGAACACG AAAGAACGCA AAAACGCTGT GACTATGCGG
AGCCGTGCCG TCGGCCTCCG ATTTGACGAC GCAGGTCAGG CCGAGATTAG CCTCCCAGCG
TTTAGTGTGT CGAGTATCGT GATTACGCCT TGA
 
Protein sequence
MQTVTVRTRR GIKSGLTATA TRSVRTSASA LILWGGLHVS ADAAEQTASQ WKLPRAVQIQ 
RSQEVLIEVD PNSVIRSAVP AALFGFNIPW MNFQRGYWRE NQVRPEVIAW LKPFSGAVYR
YPGGEWSNWF EWEKAAGPGS SRPEQYTTFG STKAEFGFDE FLDFVKAVNG VPLVTVNLKG
TKGAPWSDQQ AVESNVEWLQ HSVQREGRAV ARGNAQFCQT GTKCPVEWWE LGNELDWGKG
AWTHTEYVNR AREVGLAMKN IDPAIKLIAH TTSSPWSTKR VIGEKPKARD FDSAVGAGLG
DLAYGYAYHP YYDGISIPAA NHYMERAINY LRPTASTGKP PPIFVTEHGR WPNQPKFGKW
ETTWIKTGNL GGAVSTADFL LSQMVIPDVR AAMWHSLGAR GPWQLFYLDA AADALYPNVV
YWALRVLREG LIGDALKVSV SSPNTSGYRG GYDVRAVFMR DSTHTRYSLL TINRAEKEQK
ARLLFPEWAG RKVKGQQYYV SGDSDALANT KERKNAVTMR SRAVGLRFDD AGQAEISLPA
FSVSSIVITP