Gene Bind_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1034 
Symbol 
ID6199987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1185432 
End bp1186904 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content55% 
IMG OID641705026 
Productundecaprenyl-phosphate glucose phosphotransferase 
Protein accessionYP_001832166 
Protein GI182678020 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTCA GCCGTACTGG TGAGATGGTC GAACAGGGAA ATCATGTCGA GCACCATTCC 
TGGTTCGCTC AGCATATCAA TATATCTTAC CAAAATATCG AGTGGATCGC GGCACTCATT
GATATTATAC TTATAATTTC TGCCAGCATC ATCGGTACTA TCCTTTTTCA ATATATCGCC
TATGGCGATT TTGTCGGCTT GGAAGCCAGT TTCGGTGTCG GCGTCACGAA TAGCCTCCTT
TATGCCTATG TCGCGCGCTC ACATGGTTTA TATCGGCTGC CCGTTCTCCT CTCCCCTCTC
CGGCACCTGG CTCGTATCTT CTTCTACTGG CTTGCTGTCG GCATGTTCGT CGCTGCTGGC
CTCTTTCTGT TGAAAGTTGG CACCGCCATG TCACGCGGTG CCATGGTCTC CACCGGCTTG
TTGCAGATCA GCTTCCTCGT CATTGCGCGA CTGCTGGAAG AAAAACTGAC TCGCTCCATG
GTCGCCTCGG GCAACCTTGC CGGACGGCGC GTCGTCACGA TCGGCGAAAG CGGTGAATTA
CAGCGCTTGA GCGCGTCCTA TCTTTTCCGT TATTTTGGTT TGAAGGAAGT CGCTCGCATC
GCGCTGACGG ATATCGGCGA TGTCAGGGCC AATCCCGCGA CAGAGAATTC CTATTTCCTT
GAGGCCATGG ACGCGGCGCG CGACCTCAAT GCTGAGGAAT TCGTCATAGC CCTGCGCTGG
AGCAGCCGCC CCTTGCTCGA AACTGCACGG GAACAGCTTC GCGCTTCTCC CCTGCCGGTC
ACGCTTTTAC CGGATCACAA TATTCGTTCG ATTCTCGGCC GGCGCGGTAT TGCCACGGGA
CGTCCAGTCG TTTCGCTCGA GCTCCAGCGC GCGCCCCTGA CCTCGCCCGA AAGAGCCGTC
AAACGCATTG TCGATATTGT CTTGGCCTCG ATTGCCTTGG CCCTGCTCTC ACCCATCTTC
TTTATCGCAG CCCTCGCAAT CAAATTGGAC AGCAAGGGGC CGGTGATTTT CAAGCAAAGG
CGCAACGGGT TTAACTCCCG CCTGTTCCTG ATTTATAAGT TCCGCTCCAT GACTGTGATG
GAGGATGGCG CCGTCGTGAA GCAGGCCCAG CGCAACGATC AGCGCGTGAC CCGGGTGGGA
GCTTTCCTGC GGCGTTCCAG CATTGATGAA TTGCCGCAAT TACTGAATGT CCTCAAGGGC
GATATGTCCC TGGTCGGCCC GCGGCCGCAC GCTCTTGCTC ATGACAATGA ATATAAGGCC
TTGATCGCGA AATATGCTTT TCGCCATCAT GTTAAGCCTG GCATGACAGG CTGGGCACAG
GTCAACGGTT TGCGGGGCGA GACCGGCCGT CTCGAGCAAA TGGTGGAACG GGTCAAGCTG
GACCTGTGGT ATATTAATCA CTGGTCGCTG GCGTTCGATA TCAGCATTCT GCTGCGCACC
TGTTTCGAGG TTTTGCGCAA TCGTGCTTAT TGA
 
Protein sequence
MYFSRTGEMV EQGNHVEHHS WFAQHINISY QNIEWIAALI DIILIISASI IGTILFQYIA 
YGDFVGLEAS FGVGVTNSLL YAYVARSHGL YRLPVLLSPL RHLARIFFYW LAVGMFVAAG
LFLLKVGTAM SRGAMVSTGL LQISFLVIAR LLEEKLTRSM VASGNLAGRR VVTIGESGEL
QRLSASYLFR YFGLKEVARI ALTDIGDVRA NPATENSYFL EAMDAARDLN AEEFVIALRW
SSRPLLETAR EQLRASPLPV TLLPDHNIRS ILGRRGIATG RPVVSLELQR APLTSPERAV
KRIVDIVLAS IALALLSPIF FIAALAIKLD SKGPVIFKQR RNGFNSRLFL IYKFRSMTVM
EDGAVVKQAQ RNDQRVTRVG AFLRRSSIDE LPQLLNVLKG DMSLVGPRPH ALAHDNEYKA
LIAKYAFRHH VKPGMTGWAQ VNGLRGETGR LEQMVERVKL DLWYINHWSL AFDISILLRT
CFEVLRNRAY