Gene Bind_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1081 
SymbolmdoG 
ID6199087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1242373 
End bp1243974 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content57% 
IMG OID641705074 
Productglucan biosynthesis protein G 
Protein accessionYP_001832213 
Protein GI182678067 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.418062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.460369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATT TTCCCCGCCG TCATTTCGTT CAAGGCTTGT TTGCGTCCAC GGCATTGCTG 
AGCTGTGCAA CATCAGGCCA CGCCGCGCCC GCCAATCCTA GTGGGAATAG CGCTGCCAAT
AATGCCGCGA CTCCCGCGCC GGCCCCAAAA TTTGATTTTG ACGATGTCGT ACGGCGGGCG
CGTGATCTTG CCGCGGCGCC CTTCGATGCG CGCCCGCCGG TTTTGCCGGA CGCTTTGGCG
CAACTCGATT TCGATACCTG GCGCGACATT AAAACGCGCT CGGACAAGGC TTTTGTGCTC
AGCCCGCAAA GCCCGTTCCG GATGCAATTG TTTCATCTCG GGCATCTTTA TAAGCGGCCG
GTTATCGTCA ATACGATCCG CGACGGGATT CCAACGCCCA TCCCCTATGC GGCTAATCTC
TTTGACTATG GCCGTGCGAA ATTCGACAAA CCTCTGCCTG TCAATCTTGG CTTCGCGGGC
TTCCGGCTGC ATTATCCTCT GAATTCTCCG AAAAGTTCCG ATGAAATCAT TTCCTTCATC
GGCGCGAGCT ATTTCCGATT CGTCGGGCGT GGAGAAAGCT GGGGGCTATC GGCTCGCGGT
CTCGCTGTCG ATAATGGCAG CACTAACGAG GAATTTCCCT TTTTCCGCGA GTTCTGGATC
GAGGCCTCGC AGCAGCAGGG CGACCGCGCG ATCATTTATG CCTTGCTCGA TAGTTCCGCC
GCGACCGGCG CCTATCGGTT CGAGCTCAGC CCCGGCAAAC AGACCGAGCT CGATGTTTCG
GCAACGCTCT TTTCGCGCAA GACAGGTGTC AAACTGGGTC TCGCGCCGCT CACCTCAATG
TTCCTGGCGG GTGAGAACGA ACATCGCTTC AAGGACGATT TCCGTTTTGA ATTGCATGAT
TCCGATGGCC TGCTGATCCA CAGCAATACT GACGAGTGGA TCTGGCGGCC GTTGCGCAAT
CCCCCCACGC CGCAAATAAC AACCTTCCCC GCACGTGATT TCCGTGGATT TGGCCTGTTG
CAGCGCGACC GTGCGTTCGA CCATTACCAG GATCTGGAAC TCGCCTATCA GTCGCGGCCG
AGCTATTGGG TTGAGCCCCA CGAAAATTGG GGCGATGGGC ATGTCGATCT GCTCGAATTG
CCAACCGCGG ACGAGACCAA CGATAATATT GTCGCCTCTT TCGTACCGAG TGAAAGCTTC
GAGCCCGGCA AGGCCCTGTC CTTCGGTTAT CGCATCACGT CTTTTCTCGA CGCGGCCAAA
TTTTCTCCGA ATGGACGTGT AATCAATACG TTCCAGACGG TGGCCAAGGC GTTGGGCTCA
TCCGAACCCG TCGTGCCCAA TTCCCGGCGT TTCCTGATCG ATTTCGCTGG GGGTGATCTC
GCTTATTATC TCGATGATCC CTCTCTCATC GAGATTGTTT CGGCGACGAA CAATGGCCAG
ATCCTGCGGA CATTCCTGAT GCCGGACCCC TATATCAAGG GCTTCCGCGC CGCTATCGAT
GTTGCGCTCC CGCCGGGCCA GACAGCGGAT CTGCGGCTCT TCCTGCGTTC GGGCCAGCGT
GTCTTGACCG AGACCTGGGT CTATTCATGG CTGCCCCAAT GA
 
Protein sequence
MADFPRRHFV QGLFASTALL SCATSGHAAP ANPSGNSAAN NAATPAPAPK FDFDDVVRRA 
RDLAAAPFDA RPPVLPDALA QLDFDTWRDI KTRSDKAFVL SPQSPFRMQL FHLGHLYKRP
VIVNTIRDGI PTPIPYAANL FDYGRAKFDK PLPVNLGFAG FRLHYPLNSP KSSDEIISFI
GASYFRFVGR GESWGLSARG LAVDNGSTNE EFPFFREFWI EASQQQGDRA IIYALLDSSA
ATGAYRFELS PGKQTELDVS ATLFSRKTGV KLGLAPLTSM FLAGENEHRF KDDFRFELHD
SDGLLIHSNT DEWIWRPLRN PPTPQITTFP ARDFRGFGLL QRDRAFDHYQ DLELAYQSRP
SYWVEPHENW GDGHVDLLEL PTADETNDNI VASFVPSESF EPGKALSFGY RITSFLDAAK
FSPNGRVINT FQTVAKALGS SEPVVPNSRR FLIDFAGGDL AYYLDDPSLI EIVSATNNGQ
ILRTFLMPDP YIKGFRAAID VALPPGQTAD LRLFLRSGQR VLTETWVYSW LPQ