Gene Bind_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2108 
Symbol 
ID6198497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2408315 
End bp2409649 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content64% 
IMG OID641706094 
Productglycosyl transferase group 1 
Protein accessionYP_001833217 
Protein GI182679071 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000441708 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCAGC CCCTGTTCCC TCCCCTGCTC GATCGCGCAA GGCATCCGCT GAACCGCCGA 
ACCGTGCTGC AAATCGTCCC CAGCCTCGTG GGCGGCGACA CAGAGCATAT CGCCCTCGAA
ATCGCGGCCG CGATTACCGA GGCAGGTGGC AAGGCCCTCG TTGCCAGCGA GGGGGGACGG
CTGGTCAGTT CGTTGCAGGC CATGGGCGGG CTCTGGATCC CTTTCCCTGC CGCCTCGCGC
GATCCGCTCG CCATGCTGCT CAACATACGT AAGCTCGCCC GCCTGATCGA AATGGAAAAA
GTGGATCTCG TCCACGCCCA TTCGCGGGCC TCCGCCTGGG TTGCCTATGG CGCGACACGG
CGGACCAAAA CGCCTTTCGT CACCACCTTC CACAGTGGCT ATACCGAAGG CATCGGCTTA
CAGCAGCGGT ATAATTCCGT CATGGCGCGG GGTGATTGCA TTATCGCCCA TTCCCATTAT
AGCGCGCGGC GGATCGAGGC CCGCTATCCA CAGGCTCGGA CCCGGATCAA GGTCATCCCC
CAAGGTATCG ATTTCCAGGT TTTCGCCCCT CAGAGTGTCG ATCCCGCCCG GGTTCAGGCT
CTGCGACAGG CCTGGGGCAT TACACCGGAT CAACGAGCGG TGCTTTTGAC CGACCCTTTT
TCCGATCTCT CAAGGGAGGA TGGCGGCGGG ACGGATGAGC CGCCCTTCAT TGAGGCAGCC
CGCCTCCTTC GCGCCCAAAA CCTCGAAGGG GTCGTTTTCA TCCTGACTGG CGATGCCACG
CCGCCTTTGG GGCGCGGCCA GGAGAGGGGT CGCGAACAAG ATCAAGACAA AATTCAGGAG
CATGGCGGCA GCAGCGCGCG GATCAAGATC CTGACCAATA GCCAAGCCGC CAAGGATTTC
GATCGCAAGA TCAGCGCTGC CGGCCTCTCC GGCATCATGC GACGCGCCAA GAGGGAGGCC
GATCTTCCAG CCGCCTTGCT TGCGAGCGCC GCTGTCGTCG CGCCTATCGC ACGGCCCGGA
GGATTCGGTC CCCTCGCCAT TGCAGCCCAA GCGATGGGCA CTCCGATGAT CCTCCCGGCT
TTGGGTGCCG CGCCAGAAAT CATTCTGGCG CCGCCGCAGG TCGATCAAGC GGAGCGCACC
GGTTGGCTGG TGCCGCCAGG CAATGCCGCG GCGCTGGCCG CGACGCTCGG CGAGGTCCTT
TCGCTCGGCG CCACGGCACG CGGCCTGCTT GGAGAACGTG GCCGGCGTCA TGTCGAGGCC
CATTTCTCAC TCGATGAAAT GTGTCGGGCA ACGCTTGATG CCTATGGCTC TCTGTTCCCG
AGCGGGGAAG AATAA
 
Protein sequence
MRQPLFPPLL DRARHPLNRR TVLQIVPSLV GGDTEHIALE IAAAITEAGG KALVASEGGR 
LVSSLQAMGG LWIPFPAASR DPLAMLLNIR KLARLIEMEK VDLVHAHSRA SAWVAYGATR
RTKTPFVTTF HSGYTEGIGL QQRYNSVMAR GDCIIAHSHY SARRIEARYP QARTRIKVIP
QGIDFQVFAP QSVDPARVQA LRQAWGITPD QRAVLLTDPF SDLSREDGGG TDEPPFIEAA
RLLRAQNLEG VVFILTGDAT PPLGRGQERG REQDQDKIQE HGGSSARIKI LTNSQAAKDF
DRKISAAGLS GIMRRAKREA DLPAALLASA AVVAPIARPG GFGPLAIAAQ AMGTPMILPA
LGAAPEIILA PPQVDQAERT GWLVPPGNAA ALAATLGEVL SLGATARGLL GERGRRHVEA
HFSLDEMCRA TLDAYGSLFP SGEE