Gene Msil_2386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2386 
Symbol 
ID7093938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2600754 
End bp2601899 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content67% 
IMG OID643465708 
Productprotein of unknown function DUF201 
Protein accessionYP_002362678 
Protein GI217978531 
COG category[R] General function prediction only 
COG ID[COG2232] Predicted ATP-dependent carboligase related to biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGGCC TGAAAACGCA CGCAGGCGCC GCGGTTCTGA TCGCGGCGTC CTCCGGCCGA 
GCTCTCGCCG CCGCGGCGCG GCGCGCGGGC TTTCGTCCCC TCGTCGCCGA TCTTTTTGAC
GATTGCGACA CGCACAGCCT CTGCGCGGCA AGCCTGATTG CGGGGGATTG GCGCGCGGGG
TTTTCCCGCG ATCCTCTGAT CGCCGCACTG GAAACACTGG CGAAAGCGGC CTCGCCGATC
GGCCTTGTCT ACGGCGCGGG ATTCGAGGAT CGGCCACTCC TTTTGGAGGA GATCGCCGGG
CGCTGGCCAG TTTTCGGCAA TCCGCCCGAG CGGCTGCGAC GCGCCAAGGA CCCGATGGCG
CTCGCCGCGC TTTGCCACGC GCTTGGCGTT CCCCATCCGG AGATCCGCCT CGGCTTGCCG
AACCCCTGCG GCGGCTGGCT CGTCAAAAGC GTCGGCGGCG CCGGCGGCTC CCATGTCGCC
CCCGCGGGCT CCGCGCGACC TGAAAACGAA AGCATTTATT TTCAACGGCT TGCCCCTGGG
CAGCCGATCT CCGTCCAATG CCTTTGCGAC GGAAGCCGGG CCATCTCGCT CGGCCTCAGC
CGGCAATGGA CGTCGCCCGC GCCGGACGAA CCCTTCCGCT ATGGCGGCTG CGTGCGCCCC
GCCGGACTTT CCTCCGATCT CGAAACGCGC CTCGCTGACG CCGCCTGCGC GATCGTCGGC
GCGCAAGGGC TCGTCGGGCT GAACAGCGTC GACTTTCTTG TCGACGAGAA CGACTTTTAT
CTGATCGAGG TCAATCCGCG GCCGGGCGCG GCGCTCGATA TTTTCGAAGA CCGCGAAGGC
CGTCTGTTTC AGGCGCACAT CGACGCATGT CTGGGGCGGC TTCCGGTCCG GCCGCTCGAA
TTCGAAGCAG CGACGGCCGC TGCAATCGCC TATGCGCAAA GAGATATTGC GGCGATGCCT
GAGCTCGACT GGCCGGATTG GACGGCCGAC CGGCAAAAGC CGCAAAGCGC CGTGGGGTTG
TATGATCCGC TCTGCACCAT TAAAGCCTGC GCAGCGCAGA CGTCCGCCGC GCGCGCCCTG
GTTGAGGCGC GCGCCGGCGC TTTGTTCGAC GCCATAAATT GTAAACTGGG GGGAGAAGCA
TCGTGA
 
Protein sequence
MSGLKTHAGA AVLIAASSGR ALAAAARRAG FRPLVADLFD DCDTHSLCAA SLIAGDWRAG 
FSRDPLIAAL ETLAKAASPI GLVYGAGFED RPLLLEEIAG RWPVFGNPPE RLRRAKDPMA
LAALCHALGV PHPEIRLGLP NPCGGWLVKS VGGAGGSHVA PAGSARPENE SIYFQRLAPG
QPISVQCLCD GSRAISLGLS RQWTSPAPDE PFRYGGCVRP AGLSSDLETR LADAACAIVG
AQGLVGLNSV DFLVDENDFY LIEVNPRPGA ALDIFEDREG RLFQAHIDAC LGRLPVRPLE
FEAATAAAIA YAQRDIAAMP ELDWPDWTAD RQKPQSAVGL YDPLCTIKAC AAQTSAARAL
VEARAGALFD AINCKLGGEA S