Gene Bind_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1805 
Symbol 
ID6201524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2047940 
End bp2050165 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content59% 
IMG OID641705795 
Productglucosyltransferase MdoH 
Protein accessionYP_001832922 
Protein GI182678776 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.710428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCC TGAACACATT CACCGACCAC GCTCATCTGG CCGATCAAAC GTTGTGGCAA 
GATCTCGAGG CTGCCGAGGG AAAAGAGGCG ATCCTTTCTT CTCCCTTCGC GGCAACGCCG
CCAGAGATGC CCCTCACCAT GCCGGCACAA AACCTTGGTC GCTATGATCG GTCTCAACGC
CGTTCCTGGC AAAAGAACCG GCCCCAGTTC TGGGTTTGGC TCGCGCGCCT CATCGTTTTC
GGCGGCGGCC TTGCCTTAAC CGCCTATGGC GCCTGGCAAA TGTATCAAGT GGTGAGCCTC
GGCGGCGTGA CGCCGCTGGA ATGGGTCCTG CTCGTTCTGT TCGTCGCCAA TTTCTCTTGG
ATTGCGCTCG CCTTCACCGG GAGCATCGTC GGCTTTATCT GGCTCTTCGT GCGTCCGCCG
GCCGATACGC CGACGCCAAA AACGTTGCGT GAAAAGACCG TCGTCGTGAT GCCGATCTAC
AATGAGGCAC CGGCCCGTGT TTTCGGCGCC ATGCAAGCAA TTTTCGAGGA TGTCGAAGCC
ACCGGGCTGG GTCAGGCCTT CGACTGGTTC TTCCTGTCAG ATACGACCGA CCCCGATATT
TTCATCGCCG AAGAACAGGC TTTCATCGCT ATGCGCGAGC GGCTTGCGTC GAAATTCGGG
TCAGCGCCGC GCCTCTATTA TCGCCACCGC CCCAAGAATA CAGCCCGCAA GGCCGGTAAT
ATCGAGGATT TCGTCACCCG GTGGGGTGGC CTATACGCGC ATATGGTGGT GCTTGACGCC
GATAGCCTGA TGACCGGCCA TGCCATCGTC ACACTTGCCG CGACAATGGA GGCCGACCCC
GATTCCGGGA TCATCCAGAC ACTGCCCTTG ATCGTCAATC GCAACACGTT GTTCGCCCGT
TTGCAGCAAT TTGCCGCGCG TATCTATGGT CCCGTCATCG CTGCCGGCGT CGCCGCCTGG
ATGGGGCGCG ACGGCAATTA TTGGGGCCAT AATGCGATCA TTCGCATCAA AGCCTTTGCC
GGTCATTGCG GCCTGCCGAC ACTCAAGGGC CGCCCGCCTT TCGGCGGCTT GATCCTCAGC
CATGATTTCG TCGAGGCGGC CCTGATCCGC AGGGCCGGCT ACAGCGTCTA TATGCTGCCT
ACCCTCGACG GCAGCTACGA AGAATCCCCG CCGTCCCTCA TCGATCTTTC GGCCCGCGAC
CGGCGCTGGT GCCAGGGCAA TCTGCAACAT CTGCGGGTGA TCGGTTCGGC CGGCTTCCAT
CTCGCCTCGC GTCAGCATTT CGCAACCGGC ATCATGGCTT ATGTCGCCTC GCCTTTATGG
ATGGCGCAAT TGATCATCGG TATCATTCTG GTGATTCAAG CGAGTTACAT TCGGCCGGAA
TATTTCACCA ACCAGTTCAC TCTCTTTCCA ACATGGCCTG TGTTTGATGC GAAACGGTCG
CTCGAACTCT TCACATTGAC CATGGCGATC CTGCTCGCCC CGAAATTTCT CGGCCTGATC
CTCGCCTTGA CACAAGGCAA AACCCGGCGT GGCAGCGGTG GCGCCCTGCC TCTCCTGATC
TCCACGTTCT TCGAGATCAT CTTCTCGGCT TTGCTCGCAC CGATCATGAT GCTGATCCAG
ACCGGCCATG TCATGCATTT CGCGTTCGGC TTTGATACAG GCTGGGATCC GCAGAGACGC
GACGATGGCT CGATCCCCTT CAAGGCAATC GTGCGCCGGC ATCGGTCCCA TGTCGTCATG
GGCGTGGTAA CGCTGATCGC AGGCTATATG ATCTCCCCTT CACTCATCGC CTGGATGTCA
CCGACCATTG TCGGTCTGTT ATTGGCGATT GTCCTGTCAT GGAGCACGGG CCTGCTCGGT
CTTGGTCTTG CTCTCCGCCG TGTGGGTCTT CTCCTCACGC CTGAAGAACA TGACAAGCCT
AAGGTCGTCG AACGCGGCAA TGTGCTTGGC GAAGAGCTTG CGGCGGCTTC AGGGCACGTT
TCCAATGCCT TGACGGTGGT CCATAACGAT GCGCGATTCC GTGCTTTCCA TTCAGCCTTC
CTCTCGTTGG GACCGAAACG CCCCCGAGGG CAGATCACGC CCGAATGGGC GCTCGCCCAA
GCCAAACTTG GAGAAGCGGC TTCTCTTGAA GAGGCAGTGA AATGGCTGCA GCCGAAGGAG
CGTCTGGCGG CGGTGCAGGA TCCGACACTC ATTGCCCGTG TGGCCGAATT ACCGAAAAAG
ACATAG
 
Protein sequence
MDALNTFTDH AHLADQTLWQ DLEAAEGKEA ILSSPFAATP PEMPLTMPAQ NLGRYDRSQR 
RSWQKNRPQF WVWLARLIVF GGGLALTAYG AWQMYQVVSL GGVTPLEWVL LVLFVANFSW
IALAFTGSIV GFIWLFVRPP ADTPTPKTLR EKTVVVMPIY NEAPARVFGA MQAIFEDVEA
TGLGQAFDWF FLSDTTDPDI FIAEEQAFIA MRERLASKFG SAPRLYYRHR PKNTARKAGN
IEDFVTRWGG LYAHMVVLDA DSLMTGHAIV TLAATMEADP DSGIIQTLPL IVNRNTLFAR
LQQFAARIYG PVIAAGVAAW MGRDGNYWGH NAIIRIKAFA GHCGLPTLKG RPPFGGLILS
HDFVEAALIR RAGYSVYMLP TLDGSYEESP PSLIDLSARD RRWCQGNLQH LRVIGSAGFH
LASRQHFATG IMAYVASPLW MAQLIIGIIL VIQASYIRPE YFTNQFTLFP TWPVFDAKRS
LELFTLTMAI LLAPKFLGLI LALTQGKTRR GSGGALPLLI STFFEIIFSA LLAPIMMLIQ
TGHVMHFAFG FDTGWDPQRR DDGSIPFKAI VRRHRSHVVM GVVTLIAGYM ISPSLIAWMS
PTIVGLLLAI VLSWSTGLLG LGLALRRVGL LLTPEEHDKP KVVERGNVLG EELAAASGHV
SNALTVVHND ARFRAFHSAF LSLGPKRPRG QITPEWALAQ AKLGEAASLE EAVKWLQPKE
RLAAVQDPTL IARVAELPKK T