Gene Msil_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0785 
Symbol 
ID7092643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp866219 
End bp867175 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content67% 
IMG OID643464122 
Productprotein of unknown function DUF58 
Protein accessionYP_002361117 
Protein GI217976970 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.159111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTGG TCCGCCTCCT GACGGGCGAC GAAAGCCGTC CGGGCGCAGG GCGCCAGGAA 
GAAGCGCTGA CGCTGTCGCA GCGCTTTCCC GGCCTCGTCG TCTCCGCGCG GGATGTGGCG
GCGAGCGTTT TGCAAGGGGT ACACGGACGC CGCCGCGCCG GCTCCGGCGA GACGTTCTGG
CAGTTTCGCC CTTTTGTCGC GGGCGAATCG CGCGGCCGCA TCGACTGGCG CCGTTCGGCG
CGCGACGATC GGCTCTATGT GCGTGAACGG GAATGGGAGG CCGCCCATAC GGTGATGCTC
TGGATCGACC GTTCCGCTTC GATGCGCTTT GTCTCAAAGC TCGCCTTGCA GGCGAAGATC
GACCGCGCTC TCGTTCTGGG CCTTGCCGCC GCCGATCTTC TCGTCCAGGG CGGCGAGCGC
GTCGGCTTGC TCGGCCTGAC CCGCCCGCTG GCGGCGCGCA ATATCGTCGA AAGGTTCGGC
GAAGTTCTTC TCAACGAATT CCGGCTGCGG GAAAAAAACG GCAAAGCCAG CGAAGCCGAG
GAGCTGCCCC CGCCCGAAGT CCTGCCGCGC AATTCGCAGG CGGTGCTGAT CGGCGATTTC
CTCAGCGCCC CGCAAGATAT TGCGGCGACG ATCGAAGCGC TCGGCGCCCT TGGCGCGCGC
GGCCACCTCG TGATGATCGC GGACCCCGTG GAAGAAACCT TTCCCTTTGC CGGCAACACG
GAATTCATCG ACGTCGATTC GCCGGCGCGG CTCCGCATCG GGCAGGCGGA ATCTTTCCGC
GCCGATTATA TCCGCAGGCT CACCGCCCAT CGCGAGGCGA TCCGCGCGGC GGCGCGGGCG
CGCGGCTGGA CCTTGATGCT GCACAGGACA GATCGCCCGG CGACCGAGGC GCTGCTGGGT
TTGAGAATGC AGCTTGAGGC CAATCTGTTT AACGCCGCCG CCGGCCACGC GCTTTGA
 
Protein sequence
MPLVRLLTGD ESRPGAGRQE EALTLSQRFP GLVVSARDVA ASVLQGVHGR RRAGSGETFW 
QFRPFVAGES RGRIDWRRSA RDDRLYVRER EWEAAHTVML WIDRSASMRF VSKLALQAKI
DRALVLGLAA ADLLVQGGER VGLLGLTRPL AARNIVERFG EVLLNEFRLR EKNGKASEAE
ELPPPEVLPR NSQAVLIGDF LSAPQDIAAT IEALGALGAR GHLVMIADPV EETFPFAGNT
EFIDVDSPAR LRIGQAESFR ADYIRRLTAH REAIRAAARA RGWTLMLHRT DRPATEALLG
LRMQLEANLF NAAAGHAL