Gene Msil_1131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1131 
Symbol 
ID7093894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1219095 
End bp1220123 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content60% 
IMG OID643464472 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002361462 
Protein GI217977315 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.079056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCT ATGCGCTCAC GAGAGTCAGC ACGATCGGCC CTGTCGCCGA CGAGATTGAG 
CGCGCCGGCG GCTCTCTTGC GCGAGTTTTT CGTCGCGCCG AGTTGCCCAT AAAACTCATC
GAAAAACCAA ATCAGCTGAT CCTGCTGCGA GACCAGTTCA GGCTGATCGA AAACGCCGCG
CGCGAGATTG GCGACGAGTC GCTTCCGGCG CGACTTTCAA TTCATGCGGG ACTTGCCGGA
CTAGGACCCT ACGGGCGTCA TTTCATGTCG TTTTCACGTT TGGGCGCGGC CATTTCAGAA
GGGGTTGGCG CTTTCGCCGA ATTGCTCCAG GCGGCGACGC GGATGCAGCT GGCCGTGAAC
GGGCGTTGGG CGAAATGGAG CTATTCCATC ACGGAGCCGA TCGACGTCGG CCGCCAGAAG
AACGAATTGC TGGCACTTGG CTATATGATC GATCTTCTCC GTTGCTATGC GGTCAAAGGA
TGGGCGCCGG ATCATCTCGA ATTGCCGGGC GCGACGTTGC AGGCGAAGGC CGATGTTGAG
GCCATTTTCG GATGCAGTCT GAAATCAGGG CCGGCGGCCG CCGTGATATT TCCGGCCGAA
CTTCTTCAAT TGCCGAATCC GGGTTCGGCG CGGAAAGCGC CAGGCGACGA TTTTTTGTTT
ATTCCGCCGG AGGACGACTT TCTTTCCTGC GTGTCTCACC TGATTCAGTG CGAAGCGCTC
GCGGGACGCC CGAGCATCGA TCGGATCGCG CGGCGTCTCG GTTTGCCCCG GCGCACGCTG
CAGCGCCGGA TGGAAGAACG CGGCGCGACT TTCGAATCGA TGTTGCGCGC TACGTTGCTG
CGGCAAGCCT CTGCTTTTCT GAAGGAGCCC GGCCTCTCCA TCACCGATCT TGCGTTTGAA
CTCGGCTATT CCGATCCGGC GCATTTTACG CGCGCATTCC GAAGCTGGAC GGGAATGTCT
CCGCGGGAGT GGCGCAGTCG TCTGCAAGAG TCGCATGCTC CGAGACCGGA GGCTTTATCG
AACGGATAA
 
Protein sequence
MTAYALTRVS TIGPVADEIE RAGGSLARVF RRAELPIKLI EKPNQLILLR DQFRLIENAA 
REIGDESLPA RLSIHAGLAG LGPYGRHFMS FSRLGAAISE GVGAFAELLQ AATRMQLAVN
GRWAKWSYSI TEPIDVGRQK NELLALGYMI DLLRCYAVKG WAPDHLELPG ATLQAKADVE
AIFGCSLKSG PAAAVIFPAE LLQLPNPGSA RKAPGDDFLF IPPEDDFLSC VSHLIQCEAL
AGRPSIDRIA RRLGLPRRTL QRRMEERGAT FESMLRATLL RQASAFLKEP GLSITDLAFE
LGYSDPAHFT RAFRSWTGMS PREWRSRLQE SHAPRPEALS NG