Gene Bind_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3014 
Symbol 
ID6198238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3430561 
End bp3433845 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content57% 
IMG OID641706962 
Producttrehalose synthase 
Protein accessionYP_001834065 
Protein GI182679919 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.602191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATC GGAATGACGT ATCCTGGTAT CGCGACGCCA TCATCTATCA GCTCCATGTG 
AAATCCTTTT ACGATCACGA CAATGATGGC ATGGGCGATT TCAAGGGCCT GACCGCCAAG
CTCGACTACA TCAAGGATCT CGGGGCGACG GCGATCTGGT TAATGCCCTT CTATCCCTCT
CCCCTGCGCG ACGACGGTTA TGACATTTCG GATTATCGCA CGATCAATCC TGCCTATGGC
GGGCTGCGCG ACTTCCGACG TTTCGTGCGC GAGGCCCATG AACGCGATCT GCGCGTGATC
ACGGAACTCG TCATCAATCA TACATCGGAT CAGCATCCCT GGTTCCAGCG CGCGCGGGCG
GCAAAACCCG GTTCCGCCGC GCGCAATTTC TATGTCTTCG CCGAAAACGA TCACCGCTAC
AAAGATGCGG GCATTGTCTT TCTCGATACT GAGAAATCCA ACTGGACCTG GGACGACGAG
GCAAAAGCTT TCTACTGGCA TCGTTTCTAT GCACATCAGC CGGATCTTAA TTTCGACAAT
CCGCGCGTCA TCGAGGCCGT GCTCGACATT ATGGTGTTCT GGCTCGATAT GGGGGTCGAT
GGATTACGGC TCGATGCCAT TCCCTATCTC ATCGAACGGG AAGGCACGAA TTGCGAAAAT
CTGCCTGAGA CCCATGCCAT CATCAAAAAA ATCCGTGCCG CTCTCGATGC CCGTTACAGC
GATCGCATGC TGCTCGCCGA AGCCAATCTC TGGCCAGAGG AGACCGCACA ATATTTTGGC
GATGGCGATG AATGCCACAT GGCCTTTCAT TTCCCACTCA TGCCAAGAAT CTACATGGCC
CTCGCTCAGG AAGACCGGCA CCCCATCACC GACATCATGC GGCAGACCCC CGATATTCCT
GAAAGCGCGC AATGGGCGAT CTTCCTGCGC AATCATGACG AAATGACACT TGCCATGGTC
ACCGACAAGG AGCGTGACTA TCTCTGGTCC TTCTATGCGG CAGACCAGCG TGCCCGCATC
AACCTCGGCA TCAGGCGGCG GCTCGCCCCT CTGCTCGGCA ATGATCGCCG CAAGATCGAA
CTGTTGAACT CGCTGCTTCA TTCCATGCCG GGCACACCCG TGCTCTATTA TGGCGACGAG
ATCGGTATGG GCGACAATAT CTATCTCGGC GATCGCGATG GCGTGCGCAC GCCAATGCAA
TGGTCGGTCG ATCGCAATGG CGGCTTTAGC CGCGTCGATC CGGCCAAATT GTTCCTGCCG
GCCATCCAGG ACCCAATCTA TGGCTATAGC GCCATCAATG TCGAGGCGCA GCTTGCGAGC
ACAACCAGCC TGTTGAACTG GACGCGGCGC CTGATTGCCG TGCGTCGTTC CCATGCCGCC
TTTGGACGCG GCAGCTTGCG TTTCCTGTAT CCATCGAACC GCCGGGTGCT TGCCTATCTG
CGCCTCCATG AGGGCGAGAC ACTTCTCTGC GTCGTCAATG TATCGGGCTC CGCACAGGCC
GTGGAACTCG ATCTCGCCGA ATTCAAAAAT GCCGTTCCGG TCGAGTTGAT GGGCGGCGTT
CTGTTTCCAC CGATCGGCAC CACCCCCTAT CTGCTGACAC TGGCCGCTTA TGGTTTTTTC
TGGTTCCGCC TGGAACCGGT GGAGGCGTTG CGCGATCAGA TGAAGGTACG GCCGACGCCT
GAATTATTTA CCCTCGTCGC GACCGGGAAA CTGGAGACCA TTCTGGCCGG ACGTGAATTG
GCGGCTTTCG AGCGCACTGT CGCACCGCGT TTTCTCGTCT CGCGGCCCTG GTTCGATAAT
AGCGAGCGGA AGATCAGTTC CGTCAACGTG CGGGATTTCG CGATTCTGCG CAATGTTCAC
CAGGGCCGTT TCATTCTGCC GCTCCTCGAT GTCGATTTCG ATAATGGCTC CGCGCAAGCC
TATTTCACGC CCTTTGCCGC CGAAGCTGGC ACGGAGGACG AAAACGCGCT GACCCATGGT
GTCGCCGTGC TGCGTCGGGC GGCCGAGACT GGTCTGCTCT ATGACGCCGA TGTCAATCCC
GCTTTTGCGA TTGCGATGAT CGATGCCTTG CGGAGCGAGG CTGTCATGAC AACACCAAAG
AGCAATCGTT TCGTCTTCCG GCCGACCAAA CTCCTCGCCG ACCGCCTCGA CGCGGAAGCC
GATCTTACGG CGCTCTCCGT ACATCGGCTA GAGACTGAAG ACGACAAGAT TGCGCTGGCC
CTCTCCAATC GGCTGATCCT CAAGATCAAT CACCGTCTTC ACGAGGGAGT AGAACCGGAA
ATCGAGACCA GTCTGTTTTT GTCCAGCACG GCGCTTTTCA CCAATATGCC CGCCTTGCTC
GGCACGATCG AATATGTCGA CAAGGAAGGA AATCGGACCG CCCTAGCAAT TCTGCAGAGC
TTCATTCGCA GTCAGGGGGA CGCCTGGCGG TGGACACTGC ATGCCTTGAA GCGTGTGCTT
GAAACACAGG CACTGGCGCC GACCTCGACA CAATCGGAAA CCGCACCACC TGATAGTTTT
GCGACCTATG TTCCGCATAT GCGCCGGCTT GGGCAACGCA CCGCCGAATT ACACAAGGCC
CTGGCAGTGG AGACAGACGA TCAGGCTTTC GCGGCCGAAC CCTTGACTTT CAAGGATGTC
GAGGCAGCGG CCGCGCGTGC ACGCTCTCTC GCCGAGCGCG CTTTCACCCA TCTCGAACGT
CTCAAGCGGC AAGCAAGTGT CACAGAAGAA ACCAGCAACG CTTGCATGGC TCTCCTGCAT
CGCCAACAGG ACTGTTTTGC GCTGATCGAA CGCCTGACGC AGCCTCCCGT GGGGGCGATT
AAGATCCGCA TTCACGGCGA TTATCATCTC GGTCAATTAC TGGTCGTGAA GGACGATGTC
GTCATCCTCA ATTTTGAGGC CAATTCGGCC CGGACTATCG CAGAGCGACG CACCAAAGCC
TCGCCCTTGC TCGATGTCGC CGATATGGTA CGTTCCTTCG CTTATGCGGT GGAAACCGCG
CGGCGTGATC TGGCAAGACA ACTTCCCGGC ACCACAGTCG CAAGCGAACT TGCGAAGGAA
CGCCTCCGTT TCTCCCGGGT CTTCATCGAT GCCTATATGG AAGCGGCCAC CGATAGCGCG
ATCTGGATCA AGGACCAACC GACCCGCATA CGCCTGTTAC GTCTGTTTTT CCTGTCGAAG
GCTCTCTCTG CGCTCGATCA CGAAGCCCTC AACCGACCCG ACTGGATCGC TCTTCCGATC
GAAGGCGTCC TGCTCCTGCT CGACGAGACC AGTGAGCTCG CGTGA
 
Protein sequence
MIDRNDVSWY RDAIIYQLHV KSFYDHDNDG MGDFKGLTAK LDYIKDLGAT AIWLMPFYPS 
PLRDDGYDIS DYRTINPAYG GLRDFRRFVR EAHERDLRVI TELVINHTSD QHPWFQRARA
AKPGSAARNF YVFAENDHRY KDAGIVFLDT EKSNWTWDDE AKAFYWHRFY AHQPDLNFDN
PRVIEAVLDI MVFWLDMGVD GLRLDAIPYL IEREGTNCEN LPETHAIIKK IRAALDARYS
DRMLLAEANL WPEETAQYFG DGDECHMAFH FPLMPRIYMA LAQEDRHPIT DIMRQTPDIP
ESAQWAIFLR NHDEMTLAMV TDKERDYLWS FYAADQRARI NLGIRRRLAP LLGNDRRKIE
LLNSLLHSMP GTPVLYYGDE IGMGDNIYLG DRDGVRTPMQ WSVDRNGGFS RVDPAKLFLP
AIQDPIYGYS AINVEAQLAS TTSLLNWTRR LIAVRRSHAA FGRGSLRFLY PSNRRVLAYL
RLHEGETLLC VVNVSGSAQA VELDLAEFKN AVPVELMGGV LFPPIGTTPY LLTLAAYGFF
WFRLEPVEAL RDQMKVRPTP ELFTLVATGK LETILAGREL AAFERTVAPR FLVSRPWFDN
SERKISSVNV RDFAILRNVH QGRFILPLLD VDFDNGSAQA YFTPFAAEAG TEDENALTHG
VAVLRRAAET GLLYDADVNP AFAIAMIDAL RSEAVMTTPK SNRFVFRPTK LLADRLDAEA
DLTALSVHRL ETEDDKIALA LSNRLILKIN HRLHEGVEPE IETSLFLSST ALFTNMPALL
GTIEYVDKEG NRTALAILQS FIRSQGDAWR WTLHALKRVL ETQALAPTST QSETAPPDSF
ATYVPHMRRL GQRTAELHKA LAVETDDQAF AAEPLTFKDV EAAAARARSL AERAFTHLER
LKRQASVTEE TSNACMALLH RQQDCFALIE RLTQPPVGAI KIRIHGDYHL GQLLVVKDDV
VILNFEANSA RTIAERRTKA SPLLDVADMV RSFAYAVETA RRDLARQLPG TTVASELAKE
RLRFSRVFID AYMEAATDSA IWIKDQPTRI RLLRLFFLSK ALSALDHEAL NRPDWIALPI
EGVLLLLDET SELA