Gene Msil_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3131 
Symbol 
ID7093791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3439711 
End bp3442089 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content60% 
IMG OID643466441 
Productprotein of unknown function DUF1549 
Protein accessionYP_002363402 
Protein GI217979255 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG TGACTTTGCA GAAGGCGTCC CTGGCGCTGG TGATCGGCCT CTCGGCCGCA 
GCCGCGGCGG CGGATGACAA GCCGATCCAG AACGAGCAAC GGGCGGCGGG CGAAGCGGCG
CCGGCGCCGC CGGCCGCCAG TCCGGAGACA GTGGCGGCGA GCGCTCCAGC GCCGGTCTCC
CGTTTGCCCT ATGGTTGGCG GCCGATTCAG CCTCCGGCGA TTCCAGCCGT GAAGGACGCC
GGATGGGCGC GTACGCCGAT TGACGCGTTC ATTCTCGCGA AGGTCGAAGA CAAGGGCCTG
AAGCCTTCGC CCGACGCCGA TCGCGCCGCC TTCATACGCC GCGCGACGCT CGATGTTTGG
GGACTGATCC CCACGCCCGA ACAGGTTGCG GCGTTTGTGA ACGATCCCGC CCCTGACGCT
TATGAGAAAT TGGCGGACAG GCTGCTCGCC TCGCCGCACT ACGGCGAGCG GCAGGCGCGC
TTCTGGCTCG ATCTTGCGCG CTACGCCGAC AGCGCCGGCT TTCAGACAGA CCAGACCCGT
TCGGAGATGT GGCGCTATCG CGATTACGTG ATCAACGCGT TCAATCAGGA CAAGCCCTAT
TCGCGCTTCG TTCAGGAGCA ACTGGCGGGA GACGAATTGG CGCCCGGCGA TCAGGACGTC
CTCGTCGCCG CAGGTTTCCT GACGGGTTAT CCAGACAATT ACAATTCACG CGATCTGGTT
CAAAGAAAGT ATCAGATCAC CACCGACATC ACCGACACGG TCGGGCAAGC CATTCTTGGA
ACGACAGTCG GCTGCGCGCG CTGCCACGAC CACAAGACCG ATAAGTTCTC GCAGAAGGAC
TATTATTCTC TCCAGGCGTT CTTCGCCAAT ACGAATGAAG TCAACAAGGT TCCGGCCGCG
AAGGGGCGAA TCGAGCAAGA GTTTCAGGCC CAGCAGGCCA AATGGGACGA GGCGACCAAG
GACATCCGGG CGCAGCAGGC GGCGCTGCTT GATCCCGTCA AGGACAAAGC CTGGAAATAT
CATAAGGAGC GTTACCTTAC CGACAGCCGC GATGCGATCT TCAAGCCGGA GGGCGAGTGG
ACGCCGCTGG ATCGCTGGAT CAATCATCGC CTCGCCAATG TCACGACCGA AGCCGACTAC
GCAGCCTATC TGCGTGACGC GGCGGAGAAC AAGGACAACC CCGATCACAA CCAGGAAGCC
GAAGAGCGCT GGCAAAAATA CAAGAAGCTT TCGGAGGATT TGAAAAAATT CCAGAAACTC
AAGCCGGCGA AGGGCTCTGA CACCTACACG ACTGTGTCCG AGCTCGGACA CGCTGATGCG
CCGCCCACCT TCATTTTGTT CGGCGGCAAT CATGAGCGAC CGGTGTCCGA GGTTCAGCCG
GCCTTTCCAG CGGCGCTGAC TGACCAAAAT CCGACGATCG TTCCGACGAC AGCCTCCTCG
GGGCGCCGAA CGGCGCTGGC CAACTGGCTC GTCAGCCCGC AAAATCCGCT GACCGCGCGC
GTTTTCGTCA ATCGCGTCTG GAACGAATAT TTTGGCAAGG GCATCGTCGT CACGCTGAGC
GATTTCGGCA AGGCGGGGGA AAAACCGACC AATCCTGAGC TGCTCGATTT CCTCGCGAAT
AATTTCGTCA ATGCGCAGGG CTGGAGCGTC AAGAAGCTGC ATCGGCAGAT TCTGCTTTCG
AGCGTCTACC GGCAGTCGTC CGCCTATCGC GAAGATGCGC ATGCGGTCGA TCCTGATAAC
AAGCTTCTTG CGGTATTTCC GCGCAAAAGG CTTGAGGCGG AAGTCATTCG CGATTCGATC
CTTGTCGCTG CGGGCAAGCT TGACGAGACG GTGGGCGGTC CCAGCGTATT CCCGCCAGCG
CCGGCGAACC TCAACGCTGG AAACCTCTGG GAGGTCTCGA AAGATCCGCG TGATTTCAAC
AGACGCAGCC TCTACATCTT CACGCGCCGC AGCGTTCCCT ACCCCCTCCT GTCCTCATTC
GACATGGCCT CGTCCCAGCA GGTCCACAGC AAGCGCGACG TGACCACGAC GCCGCTGCAG
GCGCTCACGC TGTTCAACAG CGACATTGTG TTTTCATGGT CGCAAATGCT GGCGGGCCGG
GTGATGCGGG AGGCCGGAAC AGATCCCCTT GCCGGCATAG ATCGCCTCTA TCAAATCCTG
TTCGGGCGCA ATCCGGACGA GGCCGAGAAG GCGACCCTCC TCGCGTTTCT GATCAGTCAC
GAAAAGGTTA TCAGGGCCAA GGCGGAAGAC GGGAAGTTCT CGGTCGCGAT TCCGATCGGC
CTGACCGACA CGCAGACGCT TGATCCTATA CGCGCGGCGA CTTTCGTCGA CCTCGTCCAC
GTCGTCGTCA ATTCCAACGA TTTCATCTAC CGGTTCTGA
 
Protein sequence
MKKVTLQKAS LALVIGLSAA AAAADDKPIQ NEQRAAGEAA PAPPAASPET VAASAPAPVS 
RLPYGWRPIQ PPAIPAVKDA GWARTPIDAF ILAKVEDKGL KPSPDADRAA FIRRATLDVW
GLIPTPEQVA AFVNDPAPDA YEKLADRLLA SPHYGERQAR FWLDLARYAD SAGFQTDQTR
SEMWRYRDYV INAFNQDKPY SRFVQEQLAG DELAPGDQDV LVAAGFLTGY PDNYNSRDLV
QRKYQITTDI TDTVGQAILG TTVGCARCHD HKTDKFSQKD YYSLQAFFAN TNEVNKVPAA
KGRIEQEFQA QQAKWDEATK DIRAQQAALL DPVKDKAWKY HKERYLTDSR DAIFKPEGEW
TPLDRWINHR LANVTTEADY AAYLRDAAEN KDNPDHNQEA EERWQKYKKL SEDLKKFQKL
KPAKGSDTYT TVSELGHADA PPTFILFGGN HERPVSEVQP AFPAALTDQN PTIVPTTASS
GRRTALANWL VSPQNPLTAR VFVNRVWNEY FGKGIVVTLS DFGKAGEKPT NPELLDFLAN
NFVNAQGWSV KKLHRQILLS SVYRQSSAYR EDAHAVDPDN KLLAVFPRKR LEAEVIRDSI
LVAAGKLDET VGGPSVFPPA PANLNAGNLW EVSKDPRDFN RRSLYIFTRR SVPYPLLSSF
DMASSQQVHS KRDVTTTPLQ ALTLFNSDIV FSWSQMLAGR VMREAGTDPL AGIDRLYQIL
FGRNPDEAEK ATLLAFLISH EKVIRAKAED GKFSVAIPIG LTDTQTLDPI RAATFVDLVH
VVVNSNDFIY RF