Gene Msil_3788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3788 
Symbol 
ID7090716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4148968 
End bp4150401 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content66% 
IMG OID643467073 
Producttranscriptional regulator, XRE family 
Protein accessionYP_002364032 
Protein GI217979885 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.432624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TGTTCGCCGG CCAGATCCTG AGGCGCCGCC GCGAGGAGCT CTCCCTCCCG 
CAAGCGCTCG TCGCCAGGCG CCTCGGGATC TCGCCAAGCT ATCTGAACCA GATCGAAAGC
GATCAGCGCC CGCTGACCGC AGCGATCCTC GTCGAGGTCA CCCGCGTCCT GAAGCTGCAG
GTCGCCGATC TTTACGACGA TGGGCCGGAA CGACTCGCCG CCAATCTGCG CGAGATGCTG
TCCGATCCCC TGTTCGAGCA CGCGAGCGTC AGCGGCCGCG AACTGAAGGC GATCTCGGCC
GCAGCGCCCC ATCTCGTTCG CGCCATGCTC GATCTGCATT CCTCCTATCG CCGCATGGAG
GAGCGCTATC GCGGGCTCGA CGACGCCCTG CGCCAGGGCG AGGGCCCGAT CGAGGCGCAA
AGGCGTCCCT TCGCCTTCGA CGAAGTGCGG GATTTCTTCC ATTTCATGGG CAATTACTGC
GATGCGCTGG ACCGCAGCGC CGAGGCGCTG GCCGAGCGGC TTTGGGGCGA GAACGCCGTC
TCCTATCAGA GCCTTGCCGA CTATTCCGCG CGCGAGCTCG GCGCGCGCGT GATCGTTTCG
ACTCCGAAGA CGGCCGGCGG CCCGATCAGC CGCTACGACC CTGCCATCGG CGCCCTGTTC
CTTGACGGCG ATCAGGAGCC GGCGACGCAG AAATTCCTGC TCGCCTGCCA CATCGCGGCG
CGCACGCAGC ACGAAAAAAT CGACGAGCTC GCCGGCGCCG CGCGGTTCAA GAGCCCGGCC
TCGGCGGAGA TCGCCAAACT CGCGCTCGCC AATTATTTCG CCAGCGCGCT TGTGATGCCC
TATGCGCGCT TTTCCGGCGA AGCGCGCGCG GCCCGCCATG ACGTCGAGCG CCTGTCGCGG
CAGTTCGGCG TCAGCATCGA ACAGGTCTGC CACCGTTTCT CGACGCTGCA GCGACCCGGC
CGCGAGGGCG CGCCCTTCTA TTTTCTGCGG GTCGACCGGG CCGGCAACAT CACCAAACGC
CACAGCGCGA CGCGTTTCCA GTTCGCCCGC TATGGCGGCG CCTGTCCCTT GTGGAACATC
CATGAGGCGT TCGAGCGGCC CAACCACTTC CTCGTGCAGG TCGCCGAAAT GCCGGATGGC
GTGCGCTATC TTTCGGTGGC GCGCTCGATC GCCAAGAAGG GTGGCTCGTT CGGCTCGCCG
CAGCGCAATT ACGCGATCGG CTTTGGCTGC GAGATCGATC ATGCGCAAGA TCTCGTCTAT
TCCGACGCGA TCGATCTTCG CAGCTCCGCA ACGGTCGCCA AGATCGGCGT CGCCTGCCGC
GTCTGCGAAC GGCAAGACTG CCTGCAGCGC GCCGTGCCGC CGCTCGACGC GGCAATCGAG
GTCGACGCCA ACGAACGCGG CATGGTGCCC TACCGGCTCA GAAGCGGCGG GTGA
 
Protein sequence
MKKLFAGQIL RRRREELSLP QALVARRLGI SPSYLNQIES DQRPLTAAIL VEVTRVLKLQ 
VADLYDDGPE RLAANLREML SDPLFEHASV SGRELKAISA AAPHLVRAML DLHSSYRRME
ERYRGLDDAL RQGEGPIEAQ RRPFAFDEVR DFFHFMGNYC DALDRSAEAL AERLWGENAV
SYQSLADYSA RELGARVIVS TPKTAGGPIS RYDPAIGALF LDGDQEPATQ KFLLACHIAA
RTQHEKIDEL AGAARFKSPA SAEIAKLALA NYFASALVMP YARFSGEARA ARHDVERLSR
QFGVSIEQVC HRFSTLQRPG REGAPFYFLR VDRAGNITKR HSATRFQFAR YGGACPLWNI
HEAFERPNHF LVQVAEMPDG VRYLSVARSI AKKGGSFGSP QRNYAIGFGC EIDHAQDLVY
SDAIDLRSSA TVAKIGVACR VCERQDCLQR AVPPLDAAIE VDANERGMVP YRLRSGG