Gene Bind_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1804 
SymbolmdoD 
ID6200535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2046342 
End bp2047952 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID641705794 
Productglucan biosynthesis protein D 
Protein accessionYP_001832921 
Protein GI182678775 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.6877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.672061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAAC GCAGGGATTT CCTGAAATTC ACCCTTGGCG GCGTCGCCAT GGGACAATTG 
GCTCCAGGCC TCGGCCGAGA CGGGAAGAAT ACTTCACCTT TTACTCTCGC CAGCAGCGCC
AACGCCGAGG AAGCGACAAC ACCAGCTCCC AAAAGCCCAT CCTCCGGCTT TACCCTCGGC
GAGGCCAAGC CCTTCGAGCC CGCCAATGTC ACGGAGGCCG CCCGCGCGCT TGCCCGTCAA
GCCTATAAGC CGCTCCGTCA GCCGCTGCCG GATGTTTTCG CTGCGCTCAG CCATGATCAA
TACGGCACCA TCTATGCGAA GCCGGGCACG GCCCTTTGGG CGGGAGACAA TCTCGGCTTC
GCGATCGAGC CTCTGCATCG CGGCTTTATC TTCTCCGACC CCATGGAGAT CAATATTGTC
GAGCATGGCG CCGCACGCCG CCTCGTCTAT GATCAGGCTC AATTCGCCTT TGGCAAACTC
GCCGTGCCGA ACGCGATGGG AGATATTGGT TTTTCCGGGT TTCGGATCTT GGTGCCGCAA
GATGCCCAGA ATTTCGGGGC GCTCGCGACC TTTCAAGGCG CCAGTTTTTT CCATGCGATA
GCCCGCGGGC AGAGCGAGGG CGTGACCGCG CGAGCCCTAT CGATCAAAAC CGCCGATCCG
CGCGGCGAGG AATTTCCGGC GATCCGCGCC ATCTGGATCG AAACCCCGAC CCTCGCCGAA
AATGCCCTGA CGCTCCATGC CCTGATCGAT TCAGAAAGCG TTGCCGGCGC CTATCGCTTC
ACCCTGCATC CCAGTGAAGC GACGATCATC GATACGGAAT GCACGCTTTT TGCCCGTACG
AATGTCGACC GTTTCGGCCT TGCCACCATG ACCGGTGCGC ATCTTCTCGC GCCCGTCGAC
CAACGCCATA TGGACGATCT GCGACCACAA GTCAGCGAAG TCGGCGGCCT TGCGATGCTG
TCCGGCCGAG GCGAATGGCT GTGGCGGCCC GTGGCCAATC GCGAAACCTT GCAGATTTCC
GTATTCACGG ACGAAAAACC GCATGGTTTT GGTTTTTTGC AGCGCGACCG TGCTTTTGAC
GCTTTTCAGG ATGATTTTCA GCATTGGGAA ACACGCCCCT CCCTGTGGAT CGAACCGATC
GGCGAATGGG CTGCCGGCGC CCTGCAATTG ATCGAAATCC CCTCGGATGC CGAGATTAAC
GACAACATCC TTGCCTTTTG GCAGCCGCGT CAGGCCTTGG CACCGGGCAG CGAGACCTCC
TTCGCCTACC GGCAATTCTG GTGCTGGACT CCCCCCGTCA GTCCTCCCCT GGCGATCACC
ACGGCTTCGC GTCAAGGGCA TGGCTCAGCG GCGCGGCGGC GCCGTTTCCT TGTCCAGTTC
ACAGGCCCCG GCCTCGCCGA TCCGCAAAAG ATCAAGGACG TGAAACCAAA TCTGACCGCC
ACGCCGGGCA CGATCCTCGA CATGCGCACA TTCGCGCTGC CCGAACGGCA ATCCTATCGC
ATTGTCTTCG AACTGGATCC CGGCACTGAA ACCACTTCGG AAATGCGCCT CGTGCTTGAG
GTTGCGGGAG TTCCGATCAG CGAGACCTGG CTTTACCGAT GGACGCCCTG A
 
Protein sequence
MIERRDFLKF TLGGVAMGQL APGLGRDGKN TSPFTLASSA NAEEATTPAP KSPSSGFTLG 
EAKPFEPANV TEAARALARQ AYKPLRQPLP DVFAALSHDQ YGTIYAKPGT ALWAGDNLGF
AIEPLHRGFI FSDPMEINIV EHGAARRLVY DQAQFAFGKL AVPNAMGDIG FSGFRILVPQ
DAQNFGALAT FQGASFFHAI ARGQSEGVTA RALSIKTADP RGEEFPAIRA IWIETPTLAE
NALTLHALID SESVAGAYRF TLHPSEATII DTECTLFART NVDRFGLATM TGAHLLAPVD
QRHMDDLRPQ VSEVGGLAML SGRGEWLWRP VANRETLQIS VFTDEKPHGF GFLQRDRAFD
AFQDDFQHWE TRPSLWIEPI GEWAAGALQL IEIPSDAEIN DNILAFWQPR QALAPGSETS
FAYRQFWCWT PPVSPPLAIT TASRQGHGSA ARRRRFLVQF TGPGLADPQK IKDVKPNLTA
TPGTILDMRT FALPERQSYR IVFELDPGTE TTSEMRLVLE VAGVPISETW LYRWTP