Gene Msil_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1049 
Symbol 
ID7091877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1136128 
End bp1137180 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content68% 
IMG OID643464388 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002361380 
Protein GI217977233 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.049947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGACA GCTCTTACGC CGCCAGGGCC GAAACGCTCG ATTCTTCGAA GACTGCTGTT 
CCGCATTTTT CCTTTTCTAC CAAAGATCTC CCGTCCACTG ACCAATTCGG CGCGTGGCGC
GAATTTATGT CGTCGACCGT CGAAATCCAG CGGCTCGAAG GGAAAGAGCA GGGATTTGCG
GCGGATCAAC AGGTGTGGAG CCTTGGCGCC TTCGCGCTGA CGCACGCCCT GATGCCGGGC
GAGGGACATG CGCGCGCCTG GCGCCATCTT GGCAAGGATC CGATCGATCA CTGGTGCCTC
GTCGTCGTCC GCGACGCCGG CCGCGATGGC TCGAGGCTGA TCGGACTGCG GTCGCTCGGG
CGCCCGTTCG AAGGGGCGGC GGCTGACCGC GACGTGCTGT CCCTGTTCGC GCCGCGAAGC
CTGTTTCGCG GCCTCTCCAG CCTGCTCGAC GCCGCGCCCG ACGTCATCGC CGACGTCGGC
CTCGGCGCGA TTCTCGCCGA TTATCTGCTC TCATTGCAGC GCAGGCTGCC CGGCGTAACC
GAGGCCGACG CGCCGCAAAT CGTCGAAGCG ACGCGGGCGA TGATCGCCGC CTGCCTGACG
CCGGCGGCCG ATTTGCGCGC CGCGGCGGAG GGCGCCATCG CTGCGACTGT GCTTGAGCGG
GCCGATGCGA TTATTTCGGC CAATCTCGCC GCCAAGGACC TCGGCCCCGA ATTTCTCTGC
CGCGCCCTCG GCGTCTCCCG CTCGCGGCTC TACCGGCTGT TCGAGCCGAC GGGCGGGGTC
AGTCGGGCGA TCCAGCGCGC GCGGCTGATC CGGGCGCAGG ACGCCTTGCG CGATCCGGCC
GACGGGCGCC CGATTGTCGT CATTGCCGAC GCGCTGGGTT TTGCCGATCC GTCGAGCTTC
AGCCGGTCCT TCCGTCGAGA ATTCGGCCAT AGTCCCAGCG ACGCGCGGAG CGCCGGGGCG
CTCGGCTTTT TAGCCCCGCT GGCCGCCGCC AAACCCCTCG TTTGCGCCCC GATCGATCGT
CTCGGCGACG TCCTTCGCAG CCTTCATGCC TGA
 
Protein sequence
MDDSSYAARA ETLDSSKTAV PHFSFSTKDL PSTDQFGAWR EFMSSTVEIQ RLEGKEQGFA 
ADQQVWSLGA FALTHALMPG EGHARAWRHL GKDPIDHWCL VVVRDAGRDG SRLIGLRSLG
RPFEGAAADR DVLSLFAPRS LFRGLSSLLD AAPDVIADVG LGAILADYLL SLQRRLPGVT
EADAPQIVEA TRAMIAACLT PAADLRAAAE GAIAATVLER ADAIISANLA AKDLGPEFLC
RALGVSRSRL YRLFEPTGGV SRAIQRARLI RAQDALRDPA DGRPIVVIAD ALGFADPSSF
SRSFRREFGH SPSDARSAGA LGFLAPLAAA KPLVCAPIDR LGDVLRSLHA