Gene Msil_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3158 
Symbol 
ID7093818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3471953 
End bp3473371 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content67% 
IMG OID643466467 
Producttranscriptional regulator, XRE family 
Protein accessionYP_002363428 
Protein GI217979281 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATG TCCGCCCCGT CTTTATGGGG CCCCGCCTGC GCCGGCTGCG CCGGGATCTT 
GGCCTCACTC AGGCTAATAT GGCTACGGAT CTCGATATTT CAGCCTCTTA TGTCGCCTTG
CTGGAGCGCA ATCAGCGCCC CTTGACCGCC GACATGCTGC TGCGCCTCGC CCGCACTTAT
AAGATCGACA TGAGCGATCT CGCCGGAGAC AGCGCCGCCG AATATACCGG CCGTCTTCAA
ACCGTCCTGA AAGATCCGAT GTTCTCGGAC ATCGATCTGC CCCCGCTCGA AACCGAGGAC
GTCGCCACCA GCTACCCCGG CGTCACGGAA GCGCTGCTGC GGCTTTATTC GGCCTATAAG
GAAGAACAGC TGGCGCTGGC CGATCGCGGC GCCGAGAGCC GTGGCGGCGC AGACCGGGGC
GCGGATGCGC CGGACCCTGT CGCTGAAGCC AGGCGCTTTC TCGCTGCCCG CCGCAATAGT
TTTCCGGGCC TCGACAACGC CGCCGAGCGC CTCGCCCAGA CCGTGAGCGG ACGCGCCGGC
GTCATCGGCC ATTTGCGCGC CCGGCACCAT CTGGGCGTCC GGCGTCTGCC CTCCGAGGTC
ATGGTCGGTT CAACGCGGCG GCTCGACCGC CATCGCGACG AGATCCTGCT CGACGATTCG
CTGGACGCGG CAAGCCAGAC CTTCCAGCTG GCGCTTCAGT TGATCTATCT CGAAATGTCG
GACGAGATCG ACGCCGTGCT GCGGGAAGGC AGCTTCGCCA CGCAAAGCGG CGAGCGCCTG
ACGCGGCGGG CGCTGGCGAG CTACGCCGCC GCCGCTTTGA TCATGCCCTA TTCCGCCTTC
GCCAGGGCGG TCGAAGCGCG GCGCTACGAT GTCGAGGCGC TGGCGCGCCA GTTCGGCGCT
AGCTTCGAGC AGACCGCGCA TCGGCTGACC ACGTTGCAGA AGCCGGGGCA GGAGCGGGTG
CCGTTCTTTT TTATCCGGGT CGATCCGGCC GGCAATGTGT CGAAGCGGCT GGACGGCGCC
GGCTTTCCCT TTGCCCGCCA TGGCGGCGCC TGCCCGCTCT GGTCGATCCA CAATGTGTTC
CGCACGCCGC GCCAGATCGT CACCCAATGG CTGGAATTGC CCGACGGTCA GCGGTTCTTC
TCGATCGCCC GCACGGTGAC GGCCGGGGGA GGCGCCTATG GCGCGCAGCG CGTCGAGCGC
GCCATCGCGC TTGGCTGCGC CGCCGAACAC GCCGGCCAGC TGATTTATAC GCAGGACCAG
CCGGACTTCA GCGCCGTTGC GGCGACGCCA ATCGGCGTCA CCTGCCGTCT CTGCCACCGC
ACCAATTGCA CCGCGCGATC GGCGCCGCTG ATCGGCCGGC AGGTGCTCCC CGACGATTAC
CGTCGCGCCA GCGCGCCTTT CGGCTTTTCG GACAGTTGA
 
Protein sequence
MPNVRPVFMG PRLRRLRRDL GLTQANMATD LDISASYVAL LERNQRPLTA DMLLRLARTY 
KIDMSDLAGD SAAEYTGRLQ TVLKDPMFSD IDLPPLETED VATSYPGVTE ALLRLYSAYK
EEQLALADRG AESRGGADRG ADAPDPVAEA RRFLAARRNS FPGLDNAAER LAQTVSGRAG
VIGHLRARHH LGVRRLPSEV MVGSTRRLDR HRDEILLDDS LDAASQTFQL ALQLIYLEMS
DEIDAVLREG SFATQSGERL TRRALASYAA AALIMPYSAF ARAVEARRYD VEALARQFGA
SFEQTAHRLT TLQKPGQERV PFFFIRVDPA GNVSKRLDGA GFPFARHGGA CPLWSIHNVF
RTPRQIVTQW LELPDGQRFF SIARTVTAGG GAYGAQRVER AIALGCAAEH AGQLIYTQDQ
PDFSAVAATP IGVTCRLCHR TNCTARSAPL IGRQVLPDDY RRASAPFGFS DS