Gene Msil_1748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1748 
Symbol 
ID7090860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1903702 
End bp1904952 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID643465071 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_002362056 
Protein GI217977909 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGC TTACCTGGAT CAATGCCGCG CTCACCGCCG CCCGGCCACA GGCGATCGCC 
GCGCTGATGC GCTACTTTCG CGATCTCGAC ACGGCGGAGG AAGCCTTTCA GGACGCCTGC
CTGCGCGCGC TCCGCAATTG GCCCCAGAAC GGGCCGCCGC GCGACCCCAC GGCCTGGCTG
ATCCTCGTCG GCCGCAACGC CGCGCTCGAC GGCGTTCGCA AGCGGGCGAA GCTCGCCCCG
CTCCCGCCCG ACGAGGCGAT CTCGGATCTC GGCGACGCCG AAGCCGAACT CGCCGAGCGG
CTCGACGGCG CCCATTACCG CGACGACGTG CTGCGTCTTC TGTTCGTCTG CTGCGCGGAC
AGTCTACCGC CCGGCCAGCA AATCGCGCTG GCGCTGCGGA TCGTTTCGGG GCTCAGCGTG
CGCCAGATCG CGCGCGCCTT TCTTACCAGC GAAGGCGCGA TGGAGCAGCG CATCACCCGC
GCCAAGGCGA AGATTGCGCG CTCCGAGATC CCATTCGAGG CGCCCGGCCC GGTGGAGCGC
GCCGAGCGGC TCGGCGCCGT CGCCGCGATG ATCTACCTCG TCTTCAACGA AGGCTATTCG
TCCGGCGAAT CCTCCGTTCG CGGGTGCCTG TGCGAAGAGG CGATCAGGCT GGCGCGGCTG
CTGTTGCGTC TGTTCCAGAC CGAGCCCGAA ATCATGGGCC TGACGGCCCT GATGCTGCTC
CAGCACGCCC GCGCCGCAGC GCGCTTCGAC GAGGCCGGCG CAATCATCCT TCTCGACGAT
CAGGATCGGA GCCTCTGGGA CGGCAAGCTG ATCGCCGAAG GGCTGGCGCT GATCGACAAG
GCGCTGCGCC ATCGCCAGCC GGGGGCCTAC CAGATACAGG CCGCGATCGC CGCGACGCAC
GCCCGCGCCA ATCGGCCGGA AGAGACCGAT TGGAGGCAGA TCGACGCGCT TTATGCTGCG
CTTGAACGGC TGCAGCCCTC CCCGGTCGTC ACGCTCAACC GGGCAGTCGC CGTCAGCAAG
ACGCGTGGAC CAGCCGCGGC GCTGGAGATG ATCGCGCCGC TGGCGGCGCA GCTTTCCGGA
TATTTCTACT TCTTCGGAGC AAAGGGCGCG TTTCTTTCTG AGCTCGGCCG CAGGGAGGAA
GCGCGCGCGG CGTTTACCCA GGCGATCGCG CTCGCCAACT CGCCCGCGGA AGCCGCCCAT
ATCCGCATGC ATCTCGATCG CCTGACGCAG GCGACTGAGG CTTCGCCATA A
 
Protein sequence
MNELTWINAA LTAARPQAIA ALMRYFRDLD TAEEAFQDAC LRALRNWPQN GPPRDPTAWL 
ILVGRNAALD GVRKRAKLAP LPPDEAISDL GDAEAELAER LDGAHYRDDV LRLLFVCCAD
SLPPGQQIAL ALRIVSGLSV RQIARAFLTS EGAMEQRITR AKAKIARSEI PFEAPGPVER
AERLGAVAAM IYLVFNEGYS SGESSVRGCL CEEAIRLARL LLRLFQTEPE IMGLTALMLL
QHARAAARFD EAGAIILLDD QDRSLWDGKL IAEGLALIDK ALRHRQPGAY QIQAAIAATH
ARANRPEETD WRQIDALYAA LERLQPSPVV TLNRAVAVSK TRGPAAALEM IAPLAAQLSG
YFYFFGAKGA FLSELGRREE ARAAFTQAIA LANSPAEAAH IRMHLDRLTQ ATEASP