Gene Bind_3438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3438 
SymbolmdoD 
ID6201465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3904009 
End bp3905646 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content60% 
IMG OID641707385 
Productglucan biosynthesis protein D 
Protein accessionYP_001834484 
Protein GI182680338 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAA AATCATTTGA GTCCGAGAGC GTTGGCTTGT GCCGGCACCG CCGCGACGTC 
TTGAAAGCCC TGACGGCTTT CGGTCTCCTG TCCCAAAGTT CGGCTTTTCT TCGCGCTGAT
CCCGCGAAAA ACCCCTCACA ACTGGGATTA CGGCTTGGAC CAGCCGAGCC CTTTTCTTTC
ACCCTCTTGA AGGATCTGGC GCGTCGACGC GCCGCCGCCC CCTATTCGCC ACCCCCGGAA
ATCGATCCCG ATATCATCGC TCAACTCGGT TATGACGCCT GGGGCGAGAT TAGTTTCGAC
ATGGATCACG CCTTATTCGC CGAAGGGCCC GGTCGCTTTC CGGTCAGTTT CTTCCCGCTC
GGCAATTTCT TCCACAAATC GGTCGCGATG CATGTCGTCG CAAATGGCAC GGCGCGAGAG
ATCCTTTATG ATCCCTCTTA CTTCCAAATG CCGGCGGACT CACCGGCGCG GCGCCTGTCG
CCAAACGCCG GCTTTGCCGG TCTGCGCATC CAGGAGGCCC GCGATGGTGC CCTTGATTGG
CGCCATAATG ATTGGGTAGC CTTTCTCGGC GCGTCCTATT TCCGGGCGAT CGGCGCCCTG
CATCAATATG GCCTTTCGGC GCGTGCTGCC GCTCTCGACG TCGCAGTCGC CGGCCATGCC
GAGGAATTTC CCGATTTCAC GGGTTTCTTC ATCGAACAGG ATGAGACCCG CGACGGCCTG
ACGATCTATG CGCTGCTGGA AAGCCCTTCG CTAACGGGCG CCTGCCGCTT TGTCCTGACG
CGCGATAAAG GCGTCACGAT GCATGTGGAC CAGACGCTCT CTATCCGTAA GCCCGTCACC
CGTTTCGGTC TCGCGCCCCT GACTTCGATG TTCTGGTTCT CCGAAACCGT CAAGCCGACC
GCCGTCGATT GGCGACCGGA AGTGCATGAT TCCGATGGGC TTGCGATCTT CACGGGCAAT
GGCGAGCACC TCTGGCGGCC GCTCAACAAT CCGCCGCGCA CCATGGTCTC CTCGTTTATC
GATCAGCATC CGCGCGGCTT CGGCCTGTTG CAGCGGGATC GCATTTTCGA TCACTACTTG
GACGGGGTGC GCTATGATCT CCGTCCGAGC CTCTGGGTCG AGCCCCTGGG CGAGTGGGGC
AAAGGCGCGG TCCAGCTCGT CGAAATTCCG ACCAACGACG AAATCCACGA CAATATCGTC
GTCATGTGGG TGCCGGAGCA GCCCATGACG GCCGGAACCG AATTAAACCT CGCCTATAAA
CTCTATTGGC AGGCCGATGA GCCCTTTCCG AGCCCGCTTG CACGCTGCAT CGCGACACGG
CTCGGCAATG GCGGGCAGCC TGGCCAACCA CGGCCGAAAA GCATCCGCAA ATTCATGGTC
GAATTTTTGG GCGGCCCCCT CAAGGACCTC CTTCCCGGTG AAAAGCCAGA GGCCGTTCTC
TGGGCCTCGC GCGGCGGCTT TTCCTATATT TTTACCGAAG CCGTGCCCGA TGACGTGCCG
GGCCATTGGC GGGCGCAATT CGACTTCACC GACAGCGCCC CGGCCGACAC AAACGATCCC
GTCGAAATGC GCCTCTATCT CAAAACCGGC AACAAAGTGC TGAGCGAGAC CTGGGCCTTT
CAATATCACC CGTTCTGA
 
Protein sequence
MSGKSFESES VGLCRHRRDV LKALTAFGLL SQSSAFLRAD PAKNPSQLGL RLGPAEPFSF 
TLLKDLARRR AAAPYSPPPE IDPDIIAQLG YDAWGEISFD MDHALFAEGP GRFPVSFFPL
GNFFHKSVAM HVVANGTARE ILYDPSYFQM PADSPARRLS PNAGFAGLRI QEARDGALDW
RHNDWVAFLG ASYFRAIGAL HQYGLSARAA ALDVAVAGHA EEFPDFTGFF IEQDETRDGL
TIYALLESPS LTGACRFVLT RDKGVTMHVD QTLSIRKPVT RFGLAPLTSM FWFSETVKPT
AVDWRPEVHD SDGLAIFTGN GEHLWRPLNN PPRTMVSSFI DQHPRGFGLL QRDRIFDHYL
DGVRYDLRPS LWVEPLGEWG KGAVQLVEIP TNDEIHDNIV VMWVPEQPMT AGTELNLAYK
LYWQADEPFP SPLARCIATR LGNGGQPGQP RPKSIRKFMV EFLGGPLKDL LPGEKPEAVL
WASRGGFSYI FTEAVPDDVP GHWRAQFDFT DSAPADTNDP VEMRLYLKTG NKVLSETWAF
QYHPF