Gene Bind_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3152 
Symbol 
ID6201845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3596406 
End bp3597428 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content62% 
IMG OID641707100 
Producthopanoid-associated sugar epimerase 
Protein accessionYP_001834202 
Protein GI182680056 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR03466] hopanoid-associated sugar epimerase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGGCA AGGATGTTCT GGTCACCGGG GGATCAGGTT TCGTCGGGTC GGCGGTGAGT 
CGCGCCCTGA TCGAAGCCGG ATTTAGCGTT CGTGTCCTGA CGCGCGGAAC CAGCCCGCGG
GGCAATCTTT CGGGCCTCGA TGTCGAGATT GTCGAGGGCG ACATGCGCGA TCCGGACGCC
GTGGCCCGCG CCATGGCCGG CGTCCAGTTC CTGTTCCATG TCGCCGCCGA TTATCGGCTC
TGGGCACGCG ATCCGCGGGA AATCCACCTC AACAATGTCG AGGGCACCCG CATTGTCATG
CAAAATGCGC AAAAGGCCAA GGTTGAGCGC GTCATCTATA CGAGTTCCGT GGCGACCTTG
GCTTTCCAGC CCAATGGTTC GGTGACCGAC GAGACAATGC CCCTGTGCGA GGCGCAGGCG
ATCGGCGCCT ATAAGCGCAG CAAGATCGCC GCCGAGCGAA TGGTCACACG GATGATCCGT
GAGGAGGGGC TGCCAGCGAT CATCGTGCAT CCCTCCACCC CGATTGGCCC CCGCGACATC
AAGCCGACGC CGACCGGGCG CATCATCGTC GAGGCGGCAC GCGGCAACAT TCCAGGGTTC
GTGGACACCG GCCTCAATCT GGTTCACGTG GACGATGTCG CAAGCGGCCA TCTCGCCGCC
TTACGGCGCG GCGAAATCGG TGGCCATTAT ATTCTCGGCG GCCAGAATGT CGCTTTTTCC
AATCTGCTTG CGGAAATCGC CCGGCTCGGC GGCCATAAAA CGCCGAAATT TCGCATTCCG
CGTCCCCTGG TCTATCCCTT CGCCTATGCC GCCGAGGCCA GGGCGCGCCT AAATGGACGC
ACGCCCTTCC TGACCCTGGA CGGCTTGCGC ATGTCCAAAC ATCATATGTT CGTCAGTTCG
GCGAAGGCGG AACGTGAACT TGGCTATCAT GCCCGCCCCT ATCAGGACGC CTTGATCGAG
GCCTTCGCCT GGTTTCGCGA CCAGGGCTAC CTCGGGCTTT CCGGAATGGA GAAGTTTTCA
TGA
 
Protein sequence
MKGKDVLVTG GSGFVGSAVS RALIEAGFSV RVLTRGTSPR GNLSGLDVEI VEGDMRDPDA 
VARAMAGVQF LFHVAADYRL WARDPREIHL NNVEGTRIVM QNAQKAKVER VIYTSSVATL
AFQPNGSVTD ETMPLCEAQA IGAYKRSKIA AERMVTRMIR EEGLPAIIVH PSTPIGPRDI
KPTPTGRIIV EAARGNIPGF VDTGLNLVHV DDVASGHLAA LRRGEIGGHY ILGGQNVAFS
NLLAEIARLG GHKTPKFRIP RPLVYPFAYA AEARARLNGR TPFLTLDGLR MSKHHMFVSS
AKAERELGYH ARPYQDALIE AFAWFRDQGY LGLSGMEKFS