Gene Msil_3239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3239 
Symbol 
ID7090654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3553351 
End bp3554349 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content66% 
IMG OID643466547 
Producthopanoid-associated sugar epimerase 
Protein accessionYP_002363508 
Protein GI217979361 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR03466] hopanoid-associated sugar epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.109011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGCGG ACACGGTTCT GGTCACAGGA GCTTCGGGCT TCGTCGGATC GGCGGTCGCG 
CGCGCGTTGA CGCATTCGGG CTATAGCGTC AGGGCGCTGC TGCGGCCGAC GGCGACGCGT
GAGAATCTGT ACGGGCTCGA TGCCGAAATC GTCGAAGGCG ACATGTGCGA AATGCGCTCG
GTCGAAAAGG CGATGGCCGG CGCGCGCTTT CTTTTTCATG TCGCGGCCGA CTATCGTCTC
TGGGCGCGCG ATCCCGGCGA AATCGTGCGC ACCAACCGCG ACGGCACGCG CGTTCTGATG
CAGGCGGCTC TGCGCGAAGG CGTCGAACGG ATCGTCTATA CGAGCAGCGT GGCGACGATC
GCCTGCCGGG ACAATGGCGC GCCCGCGGAT GAATCCAGCT CGCTCGCCGA ATGCAACGCC
GTCGGCGCCT ATAAGCGCAG CAAGGTGCTG GCGGAGCAGA TCGTCAAAGA CATGATCGTG
CGGGATCAAC TGCCGGCGAT CATCGTCCAT CCCTCGACGC CGGTCGGCCC CCGCGACGTC
AGGCCGACGC CGACCGGGCG CATCATTCTC GAGGCGGCGA TGGGCCGCAT GCCGGGCTAT
GTCGACACCG GCCTCAATCT CGTCCATGTC GACGACGTGG CTTCGGGTCA TGTCGCAGCG
CTGCGCCGCG GCAAGATCGG CGAACGCTAT ATTCTGGGCG GGCAGGACGT GCCGCTCGCC
GGCATGTTGA GGGATATTGC CGAGCTTTGC GGGCGCCATC CGCCGTGGCT GCGGCTGCCG
CGCGCGCTCG TCTATCCCTT CGCCCTTGCC GCCGAGGCGG CGGCGCATCT CACCCACAAA
GAACCCTTCG TGACGATCGA CGGTCTGCGC ATGTCGCGCC ACACCATGTT CTTCAGCTCG
GCCAAGGCCG AGCGTTGCCT TGGCTATGTG GCGCGGCCCT ATCGCGAAGC GCTGAACGAC
GCCCTGAACT GGTTCACCGA AAACGGACGG CTGAAATGA
 
Protein sequence
MTADTVLVTG ASGFVGSAVA RALTHSGYSV RALLRPTATR ENLYGLDAEI VEGDMCEMRS 
VEKAMAGARF LFHVAADYRL WARDPGEIVR TNRDGTRVLM QAALREGVER IVYTSSVATI
ACRDNGAPAD ESSSLAECNA VGAYKRSKVL AEQIVKDMIV RDQLPAIIVH PSTPVGPRDV
RPTPTGRIIL EAAMGRMPGY VDTGLNLVHV DDVASGHVAA LRRGKIGERY ILGGQDVPLA
GMLRDIAELC GRHPPWLRLP RALVYPFALA AEAAAHLTHK EPFVTIDGLR MSRHTMFFSS
AKAERCLGYV ARPYREALND ALNWFTENGR LK