Gene Tbd_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1172 
Symbol 
ID3671533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp1223992 
End bp1227279 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content66% 
IMG OID637709856 
Productalpha amylase domain-containing protein 
Protein accessionYP_314930 
Protein GI74317190 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAA ATGACCCGCT CTGGTACAAG GACGCAATTG TCTACGAACT GCACGTCAAG 
GCGTTCTTCG ACGCCAACGG CGATGGCGCG GGCGACTTCC GGGGCCTGAT CAGCAAGCTC
GACTATCTCC AGGAACTCGG CGTCAACGCG CTCTGGCTGT TGCCGTTCTA CCCGTCTCCG
GGCCGCGACG ACGGCTACGA CATCTCGGAT TACCACAACC TCCACCCCTC GTTCGGCGAC
ATGGCCGACT TCCGCAGGTT CATCCGCGAA GCGCACCAGC GCGGACTGCG TGTCATCACC
GAGCTCGTCG TCAACCACAC CTCCGACCAG CATCCGTGGT TCCAGGCGGC GCGGCGCGCG
CCGCCTGGTT CGGTCAAGCG CAACTATTAC GTGTGGAGCG ATACCGACAC GCGCTTCAGC
GAAACGCGCA TCATCTTTTC GGATACCGAG AAATCGAACT GGGCGTGGGA TGAAGTCGCG
CAGGCCTACT ACTGGCATCG CTTCTTCTCC CACCAGCCCG ATCTCAACTT CAACAACCCG
CACGTCTTCA AGGCCATCAT GCGCACGATG CGCTTCTGGT TCGATGCGGG CGTCGATGGC
ATGCGGCTCG ACGCCGTTCC CTACCTGTGC GAGCGCGACG GCACGAGCAA CGAAAATCTG
CCCGAGACGC ACGCGGTCAT CCGAAGGATG CGCGCCGAAC TCGACGCGCG CTACAGCAAC
CGCATGTTCC TCGCCGAAGC CAACCAGTGG CCCGAGGACG TGCGCGAGTA TTTCGGCAAC
GGCGACGAGT GCCACATGGC CTACCACTTC CCGCTGATGC CGCGCATGTA CATGGCGATC
GCGCAGGAGG ATCGCCACCC GATCGTCGAA ATCATGGAAC AGACACCGGA CATCCCCGAC
CTTTGCCAGT GGGCCGTGTT TCTGCGCAAC CATGACGAAC TGACGCTCGA GATGGTGACC
GACCGCGAAC GCGACTATCT CTATCAGGCC TACGCGAGCG ACCCGCAGGC CCGCCTCAAT
CTCGGCATCC GCCGCCGCCT GGCGCCCTTG CTCGACAACG ACCGCCACCG CATCGAGTTG
ATGAATCTGC TGCTGATGAC GATGCCGGGC TCGCCGATCG TCTATTACGG CGACGAGATC
GGCATGGGCG ACAACCTCCT GCTCGGCGAC CGCAACGGCG TGCGCACGCC GATGCAATGG
GACGGCGGCC CCAACGGCGG ATTCTCGTCC GCGCCGACCG AGCGTCTGTT CCTGCCGCCG
ATCACCGATC CGGTCTACGG CTACGGCGCG GTAAACGTCG AGGCGCAGCA GCGCAATCCC
TCGTCGCTGC TGAACTGGAC CCGGCGGCTG ATCGCAATGC GCAAGGCGCA CCGCGCGCTC
GGTCGCGGCA CGCTGCGCTT CCTGCGGCCT GGCAACCGCA AGGTGCTCGC CTATCTCCGC
GAATACGAGG GCGAGACGAT TCTGTGCGTG GCCAACGTCG CGCGTGCGCC GCAGGCCGTC
GAACTCGATC TCTCGCCCTT CAAGGGACAC GTGCCGGTCG AACTGATGGG GCGCAGCAGC
TTCCCGCCGA TCGGCGAGTT GCCCTATCTG CTGACGCTCG GCGGATACGG CTGTTTCGTG
TTCCGGCTCG CGACCGACGT CGAGGCGCCG GCCTGGCACG AAGAAAGGCC GGTGCCGCCC
GATCTGCCCG TGCTCGTTTT GGTCGACGCG GGCTGGCGCA CGCTCTTCGC GCGGACCGAC
GAAGGCGTGA ACCAGCTCAT GGTGCGGCGT GCACGCGAGC AGCTCGAGCG CCAGATCATT
CCGCGCTACT TCCGGACGCA GCCGTGGTTC GTCTACAGCG AGGCCGCGCT CGAGAAATTC
GAGTTCGGGA CCTTGCGCGA GTGGGGGACC GACAGCGGAA CCTGGCTGCT CGCGACCGTC
ACCGTCACGC TCCTCGACGG CAGCATTTAT CACTACGCGC TGCCGCTGGG CCTCGCCTGG
GAAGACGAAG ACGAAGGCCG CGTCGCCGCC CTGCTGCACG TGACGCTCGC CAAGGTGCGG
CGGCTGGCGC GGGTCGGCAT CCTGTTCGAC GCCTTCTGGG ACGACGGCTT CTGTTGCGCG
ATCGTCTCCG CCATGGAGCG CGGCGAGACG CTTGCCTACG GCGACGGCCA CCTGACGTTC
CGCACGGCCA GCGCCTACCC CGGATTCTTC TGTCCGCTTG TCACGTCGTC GATCACCCGT
ACGGTCTCCG AACACGGGCG TCTGCGCGTG AATCTCAACG ACCAGCTGGT CCTGAAAAGC
TATCCGCGAC AAATGCAGGG CACGCATCCC GAGCTCGAGA TGTCCCGGTT CCTCACCGAG
ACCGCGAAGT TTGCGCACAT CCCGCAACTC GGCGGCACGG TCGAATACGT CGCGAGCAGC
GGCAGGCACT CCACGCTCGC GATCCTCGAG CGCTATGCGC CCAACCAGGG CGACGCCTGG
GCCTACACGC TCAACTACCT CGAACGTTTT CTCGATCTCA GCCGCACGAC GGGCGAGCAG
GCGCCCGACG GCCGGCACGG TCGCTACATG GGGCTGATGA AGACGCTCGG CGAGCGCACG
GCCGAGTTCC ATCGCGCGCT CGCGACCCCG GATCGCTCGG GCGACTTCGG CAGCGAGCCG
ATCGCGCCGC CGGATATTCT GGAGTGGGTC AACAAGGTAC GGCACGAGAT GGGCGTGATG
TACGAGCTGC TCGAACGTGC CTTGCCGGCT TTGCCGGAGC CCGTGGCCGT CGCCGCGCAG
CAGCTTTTTC TCGTTCGCCC CAAGCTCTAC CGGCGCGTCA TCCGCGCCTC GCGCGTGCGT
GTCGACGCGA GCAAGACGCG CTGCCACGGA AACTATCACC TCGGTCAGGT CTGGCTCGTG
CAAAACGATT TCCTGATCGC GAATTACGGC GGCATACCCG GCCGCAGCTG GGAAGAACGC
CGCGCAAAGC ACGCGCCGCT GCGCGACGTC GCTAGCATGC TGCTTTCGCT GTCGCAGGCG
GGCGCGGCCG CGCTCGCGCG TGTCGCCGGC GACTCCGTCG ATGTGATGGC GGCCCTGCAG
CCCCATGTCG ACGCATGGGA GCTGGCGGCG CGCAAGGCGT TCTACCGCGG TTACCGCAAG
GGCATGGACG GTCATGCCGC GTACCCGACG GACGCGACCG CGGCCGAGGC CTTGCTGACT
TTGTTCCTGG CTGAAAAAGC GATTGCGGAG CTGACCGAGG CGCTCGAACG CCGCGCCGTC
GGCAGCGCTG CTGCAATGCG TCGGCTGGTT CAGGTGACAC GCCGTTGA
 
Protein sequence
MEENDPLWYK DAIVYELHVK AFFDANGDGA GDFRGLISKL DYLQELGVNA LWLLPFYPSP 
GRDDGYDISD YHNLHPSFGD MADFRRFIRE AHQRGLRVIT ELVVNHTSDQ HPWFQAARRA
PPGSVKRNYY VWSDTDTRFS ETRIIFSDTE KSNWAWDEVA QAYYWHRFFS HQPDLNFNNP
HVFKAIMRTM RFWFDAGVDG MRLDAVPYLC ERDGTSNENL PETHAVIRRM RAELDARYSN
RMFLAEANQW PEDVREYFGN GDECHMAYHF PLMPRMYMAI AQEDRHPIVE IMEQTPDIPD
LCQWAVFLRN HDELTLEMVT DRERDYLYQA YASDPQARLN LGIRRRLAPL LDNDRHRIEL
MNLLLMTMPG SPIVYYGDEI GMGDNLLLGD RNGVRTPMQW DGGPNGGFSS APTERLFLPP
ITDPVYGYGA VNVEAQQRNP SSLLNWTRRL IAMRKAHRAL GRGTLRFLRP GNRKVLAYLR
EYEGETILCV ANVARAPQAV ELDLSPFKGH VPVELMGRSS FPPIGELPYL LTLGGYGCFV
FRLATDVEAP AWHEERPVPP DLPVLVLVDA GWRTLFARTD EGVNQLMVRR AREQLERQII
PRYFRTQPWF VYSEAALEKF EFGTLREWGT DSGTWLLATV TVTLLDGSIY HYALPLGLAW
EDEDEGRVAA LLHVTLAKVR RLARVGILFD AFWDDGFCCA IVSAMERGET LAYGDGHLTF
RTASAYPGFF CPLVTSSITR TVSEHGRLRV NLNDQLVLKS YPRQMQGTHP ELEMSRFLTE
TAKFAHIPQL GGTVEYVASS GRHSTLAILE RYAPNQGDAW AYTLNYLERF LDLSRTTGEQ
APDGRHGRYM GLMKTLGERT AEFHRALATP DRSGDFGSEP IAPPDILEWV NKVRHEMGVM
YELLERALPA LPEPVAVAAQ QLFLVRPKLY RRVIRASRVR VDASKTRCHG NYHLGQVWLV
QNDFLIANYG GIPGRSWEER RAKHAPLRDV ASMLLSLSQA GAAALARVAG DSVDVMAALQ
PHVDAWELAA RKAFYRGYRK GMDGHAAYPT DATAAEALLT LFLAEKAIAE LTEALERRAV
GSAAAMRRLV QVTRR