Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_1172 |
Symbol | |
ID | 3671533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | + |
Start bp | 1223992 |
End bp | 1227279 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637709856 |
Product | alpha amylase domain-containing protein |
Protein accession | YP_314930 |
Protein GI | 74317190 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAAA ATGACCCGCT CTGGTACAAG GACGCAATTG TCTACGAACT GCACGTCAAG GCGTTCTTCG ACGCCAACGG CGATGGCGCG GGCGACTTCC GGGGCCTGAT CAGCAAGCTC GACTATCTCC AGGAACTCGG CGTCAACGCG CTCTGGCTGT TGCCGTTCTA CCCGTCTCCG GGCCGCGACG ACGGCTACGA CATCTCGGAT TACCACAACC TCCACCCCTC GTTCGGCGAC ATGGCCGACT TCCGCAGGTT CATCCGCGAA GCGCACCAGC GCGGACTGCG TGTCATCACC GAGCTCGTCG TCAACCACAC CTCCGACCAG CATCCGTGGT TCCAGGCGGC GCGGCGCGCG CCGCCTGGTT CGGTCAAGCG CAACTATTAC GTGTGGAGCG ATACCGACAC GCGCTTCAGC GAAACGCGCA TCATCTTTTC GGATACCGAG AAATCGAACT GGGCGTGGGA TGAAGTCGCG CAGGCCTACT ACTGGCATCG CTTCTTCTCC CACCAGCCCG ATCTCAACTT CAACAACCCG CACGTCTTCA AGGCCATCAT GCGCACGATG CGCTTCTGGT TCGATGCGGG CGTCGATGGC ATGCGGCTCG ACGCCGTTCC CTACCTGTGC GAGCGCGACG GCACGAGCAA CGAAAATCTG CCCGAGACGC ACGCGGTCAT CCGAAGGATG CGCGCCGAAC TCGACGCGCG CTACAGCAAC CGCATGTTCC TCGCCGAAGC CAACCAGTGG CCCGAGGACG TGCGCGAGTA TTTCGGCAAC GGCGACGAGT GCCACATGGC CTACCACTTC CCGCTGATGC CGCGCATGTA CATGGCGATC GCGCAGGAGG ATCGCCACCC GATCGTCGAA ATCATGGAAC AGACACCGGA CATCCCCGAC CTTTGCCAGT GGGCCGTGTT TCTGCGCAAC CATGACGAAC TGACGCTCGA GATGGTGACC GACCGCGAAC GCGACTATCT CTATCAGGCC TACGCGAGCG ACCCGCAGGC CCGCCTCAAT CTCGGCATCC GCCGCCGCCT GGCGCCCTTG CTCGACAACG ACCGCCACCG CATCGAGTTG ATGAATCTGC TGCTGATGAC GATGCCGGGC TCGCCGATCG TCTATTACGG CGACGAGATC GGCATGGGCG ACAACCTCCT GCTCGGCGAC CGCAACGGCG TGCGCACGCC GATGCAATGG GACGGCGGCC CCAACGGCGG ATTCTCGTCC GCGCCGACCG AGCGTCTGTT CCTGCCGCCG ATCACCGATC CGGTCTACGG CTACGGCGCG GTAAACGTCG AGGCGCAGCA GCGCAATCCC TCGTCGCTGC TGAACTGGAC CCGGCGGCTG ATCGCAATGC GCAAGGCGCA CCGCGCGCTC GGTCGCGGCA CGCTGCGCTT CCTGCGGCCT GGCAACCGCA AGGTGCTCGC CTATCTCCGC GAATACGAGG GCGAGACGAT TCTGTGCGTG GCCAACGTCG CGCGTGCGCC GCAGGCCGTC GAACTCGATC TCTCGCCCTT CAAGGGACAC GTGCCGGTCG AACTGATGGG GCGCAGCAGC TTCCCGCCGA TCGGCGAGTT GCCCTATCTG CTGACGCTCG GCGGATACGG CTGTTTCGTG TTCCGGCTCG CGACCGACGT CGAGGCGCCG GCCTGGCACG AAGAAAGGCC GGTGCCGCCC GATCTGCCCG TGCTCGTTTT GGTCGACGCG GGCTGGCGCA CGCTCTTCGC GCGGACCGAC GAAGGCGTGA ACCAGCTCAT GGTGCGGCGT GCACGCGAGC AGCTCGAGCG CCAGATCATT CCGCGCTACT TCCGGACGCA GCCGTGGTTC GTCTACAGCG AGGCCGCGCT CGAGAAATTC GAGTTCGGGA CCTTGCGCGA GTGGGGGACC GACAGCGGAA CCTGGCTGCT CGCGACCGTC ACCGTCACGC TCCTCGACGG CAGCATTTAT CACTACGCGC TGCCGCTGGG CCTCGCCTGG GAAGACGAAG ACGAAGGCCG CGTCGCCGCC CTGCTGCACG TGACGCTCGC CAAGGTGCGG CGGCTGGCGC GGGTCGGCAT CCTGTTCGAC GCCTTCTGGG ACGACGGCTT CTGTTGCGCG ATCGTCTCCG CCATGGAGCG CGGCGAGACG CTTGCCTACG GCGACGGCCA CCTGACGTTC CGCACGGCCA GCGCCTACCC CGGATTCTTC TGTCCGCTTG TCACGTCGTC GATCACCCGT ACGGTCTCCG AACACGGGCG TCTGCGCGTG AATCTCAACG ACCAGCTGGT CCTGAAAAGC TATCCGCGAC AAATGCAGGG CACGCATCCC GAGCTCGAGA TGTCCCGGTT CCTCACCGAG ACCGCGAAGT TTGCGCACAT CCCGCAACTC GGCGGCACGG TCGAATACGT CGCGAGCAGC GGCAGGCACT CCACGCTCGC GATCCTCGAG CGCTATGCGC CCAACCAGGG CGACGCCTGG GCCTACACGC TCAACTACCT CGAACGTTTT CTCGATCTCA GCCGCACGAC GGGCGAGCAG GCGCCCGACG GCCGGCACGG TCGCTACATG GGGCTGATGA AGACGCTCGG CGAGCGCACG GCCGAGTTCC ATCGCGCGCT CGCGACCCCG GATCGCTCGG GCGACTTCGG CAGCGAGCCG ATCGCGCCGC CGGATATTCT GGAGTGGGTC AACAAGGTAC GGCACGAGAT GGGCGTGATG TACGAGCTGC TCGAACGTGC CTTGCCGGCT TTGCCGGAGC CCGTGGCCGT CGCCGCGCAG CAGCTTTTTC TCGTTCGCCC CAAGCTCTAC CGGCGCGTCA TCCGCGCCTC GCGCGTGCGT GTCGACGCGA GCAAGACGCG CTGCCACGGA AACTATCACC TCGGTCAGGT CTGGCTCGTG CAAAACGATT TCCTGATCGC GAATTACGGC GGCATACCCG GCCGCAGCTG GGAAGAACGC CGCGCAAAGC ACGCGCCGCT GCGCGACGTC GCTAGCATGC TGCTTTCGCT GTCGCAGGCG GGCGCGGCCG CGCTCGCGCG TGTCGCCGGC GACTCCGTCG ATGTGATGGC GGCCCTGCAG CCCCATGTCG ACGCATGGGA GCTGGCGGCG CGCAAGGCGT TCTACCGCGG TTACCGCAAG GGCATGGACG GTCATGCCGC GTACCCGACG GACGCGACCG CGGCCGAGGC CTTGCTGACT TTGTTCCTGG CTGAAAAAGC GATTGCGGAG CTGACCGAGG CGCTCGAACG CCGCGCCGTC GGCAGCGCTG CTGCAATGCG TCGGCTGGTT CAGGTGACAC GCCGTTGA
|
Protein sequence | MEENDPLWYK DAIVYELHVK AFFDANGDGA GDFRGLISKL DYLQELGVNA LWLLPFYPSP GRDDGYDISD YHNLHPSFGD MADFRRFIRE AHQRGLRVIT ELVVNHTSDQ HPWFQAARRA PPGSVKRNYY VWSDTDTRFS ETRIIFSDTE KSNWAWDEVA QAYYWHRFFS HQPDLNFNNP HVFKAIMRTM RFWFDAGVDG MRLDAVPYLC ERDGTSNENL PETHAVIRRM RAELDARYSN RMFLAEANQW PEDVREYFGN GDECHMAYHF PLMPRMYMAI AQEDRHPIVE IMEQTPDIPD LCQWAVFLRN HDELTLEMVT DRERDYLYQA YASDPQARLN LGIRRRLAPL LDNDRHRIEL MNLLLMTMPG SPIVYYGDEI GMGDNLLLGD RNGVRTPMQW DGGPNGGFSS APTERLFLPP ITDPVYGYGA VNVEAQQRNP SSLLNWTRRL IAMRKAHRAL GRGTLRFLRP GNRKVLAYLR EYEGETILCV ANVARAPQAV ELDLSPFKGH VPVELMGRSS FPPIGELPYL LTLGGYGCFV FRLATDVEAP AWHEERPVPP DLPVLVLVDA GWRTLFARTD EGVNQLMVRR AREQLERQII PRYFRTQPWF VYSEAALEKF EFGTLREWGT DSGTWLLATV TVTLLDGSIY HYALPLGLAW EDEDEGRVAA LLHVTLAKVR RLARVGILFD AFWDDGFCCA IVSAMERGET LAYGDGHLTF RTASAYPGFF CPLVTSSITR TVSEHGRLRV NLNDQLVLKS YPRQMQGTHP ELEMSRFLTE TAKFAHIPQL GGTVEYVASS GRHSTLAILE RYAPNQGDAW AYTLNYLERF LDLSRTTGEQ APDGRHGRYM GLMKTLGERT AEFHRALATP DRSGDFGSEP IAPPDILEWV NKVRHEMGVM YELLERALPA LPEPVAVAAQ QLFLVRPKLY RRVIRASRVR VDASKTRCHG NYHLGQVWLV QNDFLIANYG GIPGRSWEER RAKHAPLRDV ASMLLSLSQA GAAALARVAG DSVDVMAALQ PHVDAWELAA RKAFYRGYRK GMDGHAAYPT DATAAEALLT LFLAEKAIAE LTEALERRAV GSAAAMRRLV QVTRR
|
| |