Gene Bind_0658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0658 
Symbol 
ID6198663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp740293 
End bp741768 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content59% 
IMG OID641704654 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_001831797 
Protein GI182677651 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.724967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGTCC ACAATCCCCA TACTGGCCTT TCCCTTTCCT CGCCAATAGC GGCTTTCGCC 
CGCTGGACCT CTTCCTTTCG CCACAGCCGG TCCTCTCGGC ACGGCATTCG CCGCACCACC
TGGCAAATCG GTGCAGCGCT CTCCGATTTG CTTCTTCTGC TGGTTCTGCT CGCCCTGATG
GCGGGACTTG AGAGTCTTGC GGGAATCTTC GTAACCAAAA TACGATGGGA ACAGGGTCTG
CTCCCTTTCG GCCTGCTCAC CATCCTCCTT TTCATCGGCA TGGCGGCGGC GCGCGGCGAC
TATGCCCAAG CGCGGGGACT TTCGGCTCAA TCCTCGGATA GCGCGGTCCT GAAACTCTGG
GGCATGGCTT TTTGCCTCGC CGCCACCCTT GGTTTTTTCC TCGCCCTTCT CGAAGGACTT
TCCCGGCTGC AAGCCTTGAG CTTTCTGCTC TTCGGTGGCG TGGTCCTGGC TCTAAACCAT
CGTCTGGTGC GCCATTGCCT GCGTGCCGAG GCGACCAAGG GCCGGCTCAG CCTCAGGCGC
ATCTTTCTGG TCGGCTACGA AAACGAAATC GCCGCCTTTC AGGATCATCA GGACGCCGAT
GTCACGGGCA TGCAGATCGT GACAGCATCC GTCCTGCGGG GCCGCGAGAG CTTGGAGGAT
GATCTGAAAC TCGCCGCCGC CACCGCGCGC CTGCTGCGGC CGGATGATAT TTTCATTCTG
GTGCCTTGGG GGGACAGCGA AACGATCGAC CCTTGCGTTT CCGCCTTCCG CCGCGTGCCT
GCAGCTTTGC ATCTCGGCTC GGAAAACGCC TTGCGACGGT ACAGCGATGC CCGCGTCGTC
AAAATGGGAT CGCTCGCGGG GCTCACCATC GAGCAGCCTT GGTCCGGGGC CAAGGTCGTC
GCCAAACGTT CGTTCGATAG TCTCATGGCC ACATTGGCCC TTCTGCTCCT TGCTCCCCTG
TTCGCAGTGG TCGCCATCGG CATCAAGCTC GACAGCGCAG GGCCGGTCCT GTTTCGTCAG
CGCCGTTATG GGTTCAATCA GGAGCCTTTC GCGATCTTCA AGTTCCGCAC GATGAATGTG
CGCGAAGATG GGCGTCATGT CGAACAAGCA AAAGCCGCCG ATCCACGCGT GACGAAACTG
GGCCGCTTTC TGCGTCGATG GAACATCGAC GAATTGCCTC AGCTTTTGAA TGTTTTGCTG
GGAGATATGT CACTCGTCGG CCCACGCCCG CATGCCATGG CGCATGATCA GATGTTCGAG
CGCGAAGTGA CGCTTTATGC GCGGCGCCAC AATGTTCGTC CGGGTATTAC TGGCTGGGCA
CAGATCAATG GATTGCGCGG CAAAGTCGAT CAGGAATCCC TGCGCCGGCG GATCGAACAC
GATCTGTTCT ATATCGATCA TTGGACGATC TGGCTCGATA TCAAAATTCT CTGGTGCACA
ATCATGTCGC GCAAGGCTTA TGAAAATGCG CGTTGA
 
Protein sequence
MIVHNPHTGL SLSSPIAAFA RWTSSFRHSR SSRHGIRRTT WQIGAALSDL LLLLVLLALM 
AGLESLAGIF VTKIRWEQGL LPFGLLTILL FIGMAAARGD YAQARGLSAQ SSDSAVLKLW
GMAFCLAATL GFFLALLEGL SRLQALSFLL FGGVVLALNH RLVRHCLRAE ATKGRLSLRR
IFLVGYENEI AAFQDHQDAD VTGMQIVTAS VLRGRESLED DLKLAAATAR LLRPDDIFIL
VPWGDSETID PCVSAFRRVP AALHLGSENA LRRYSDARVV KMGSLAGLTI EQPWSGAKVV
AKRSFDSLMA TLALLLLAPL FAVVAIGIKL DSAGPVLFRQ RRYGFNQEPF AIFKFRTMNV
REDGRHVEQA KAADPRVTKL GRFLRRWNID ELPQLLNVLL GDMSLVGPRP HAMAHDQMFE
REVTLYARRH NVRPGITGWA QINGLRGKVD QESLRRRIEH DLFYIDHWTI WLDIKILWCT
IMSRKAYENA R