Gene Msil_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1558 
Symbol 
ID7092065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1682257 
End bp1683807 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content69% 
IMG OID643464885 
ProductHemY domain protein 
Protein accessionYP_002361870 
Protein GI217977723 
COG category[S] Function unknown 
COG ID[COG3898] Uncharacterized membrane-bound protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCG TCCTCCTGTT TCTCGCCGTG CTGATTGGTC TTGCCATCGC CGAAGCGTGG 
CTGATCGAGC GGCCGGGCGA ACTGGTGCTG AATTGGCAGG GCTATAAGAT CGAAACCAGC
GTGCTGATGG GCATCGCCGC TGTCTTCGTC GCGGCGGCGA TTCTGCTCGG CCTGTGGATG
CTGCTGCGCT TCATCTTCAA CATTCCCTCG CTGATGGCTC TGGCGAGCCG GGGGCGGCGG
CGCGAGAAAG GCTATGCGGC CCTCTCCCGC GGCATGATCG CCGTCGGCGC CGGGGATACG
CAGGCGGCGC GCCGGGCGGC GGTCGAGGCG CAAAGAATGC TTCCGAACGA TCCTCTGGCG
CTGCTCCTCA AGGCGCAGTC GGCGCAGCTC TCCGGCGAAA GCGGCTCTGC CGAGGCGGCG
TTCACGGAGA TGACCCGGCG CAACGACATG CGCGTCCTTG GCTTGCGTGG ACTGCATGTC
GAGGCCGAGC GCCAGGGCGA CGTCGAGAAA GCGCAACACT ACGCCGAGAC GGCGCGGGAG
ATCGCGCCGC TGCCCTGGGC GGCGAAGGCG AATTTCGCGC ATAAGGTCGC CGCCGGCGAT
TGGCGCGGCG CGCTGGCGCT GCTCGAGAGC GGCGCCGGAT CGAAGCATAT CGACAAGACC
ATCCGCGAGC GCAGCCGCGC CGTGCTCGAA ACCGCCATCG CGATCGAAAA GGCCGACGCC
CAGCCGGCGG AAGCGCTTGC GCTGGCGCGG GCCGCGGTCA AGCGCGCGCC GACGCTTGTC
CCGGCGGTCG CGCTCGCGGC GCGCCTGACC AGCGAGCAGG GCGACGCGCG CAAAGCGGCA
AAGCTGATCG AAGCAGGCTG GGCCGGGACG CATCACCCGG ATATCGCCAA GATCTATGTC
GGGCTCTATC CCGGCGAATC GAGCGCCGAC CGGCTGAAGC GGGCGATCTC GCTGGCGCAG
CTCGCCCCGC GCGAGCCGGA AAGCAAGATC ATGGTCGCGG AGGCGGCGAT CGCGGCGGGC
GATTTCAAGG CCGCGCGCGA GGCGATGCAG CCTCTGATCG AGGGACCGGA GCGTCCGACC
GCGCGGATGT GCCGGCTGAT GGCCGAGCTT GAAGAAAAGC AGCACGGGGC GGCCGGATAT
ATCAGGGAAT GGCTGACGCG GGCCTCGCTC GCGCCGCCGG ACCCGACCTG GGTGGCGGAT
GGCGTCGCCT CCGATCAGTG GCGGCCGATC TCGCCGGTGA CGGGAAAGCT CGACGCCTTC
GTCTGGCAGA AGCCGGTGGA GCGGCTGAGC TCGGGCAATG AGGCCGAAGA TGCGATCTTC
GCGCAGATCC TTCCGCCCGA GCCTCCGCTC CTGCTGGAGG AAGCGCAGAC GGCCTCCGGA
GCAAAGACCG GGGCGCTTGA CGGGCCGCCG CACCCAGACC CCGAGATCGC CATCCCGATG
GGCCCGCCGC CGCCCACGAC CGCTGAGCCG TCGGCCGCGC TCACGCCGCA GGAAAAGGAG
CGGGACAAGC CGAAGGGCCT CGCCCAGCTC TTTGACGTCA AGCCGAAATA G
 
Protein sequence
MIRVLLFLAV LIGLAIAEAW LIERPGELVL NWQGYKIETS VLMGIAAVFV AAAILLGLWM 
LLRFIFNIPS LMALASRGRR REKGYAALSR GMIAVGAGDT QAARRAAVEA QRMLPNDPLA
LLLKAQSAQL SGESGSAEAA FTEMTRRNDM RVLGLRGLHV EAERQGDVEK AQHYAETARE
IAPLPWAAKA NFAHKVAAGD WRGALALLES GAGSKHIDKT IRERSRAVLE TAIAIEKADA
QPAEALALAR AAVKRAPTLV PAVALAARLT SEQGDARKAA KLIEAGWAGT HHPDIAKIYV
GLYPGESSAD RLKRAISLAQ LAPREPESKI MVAEAAIAAG DFKAAREAMQ PLIEGPERPT
ARMCRLMAEL EEKQHGAAGY IREWLTRASL APPDPTWVAD GVASDQWRPI SPVTGKLDAF
VWQKPVERLS SGNEAEDAIF AQILPPEPPL LLEEAQTASG AKTGALDGPP HPDPEIAIPM
GPPPPTTAEP SAALTPQEKE RDKPKGLAQL FDVKPK