Gene Msil_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0840 
Symbol 
ID7092698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp924808 
End bp925899 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID643464177 
Producthistidine kinase 
Protein accessionYP_002361172 
Protein GI217977025 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.133727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTCGA GCGCGCAGAC GGTCGACATG CCGGCGCGCG ATCGCGCCAT GGTGGCTGGG 
CTGATTGATG CGCTGCCGGA AGCAGCGATC GTCATCGACC AGAACGATCG CGTCATCGCC
GTCAATGCGC CGGCGCGCGC CCTGTTTCCG GCGCTGCGCC GCGACCTTTT ATTGGCGCGC
GGATTACGCG CGCCCGATGT ATTCGATTGC CTGGCGCGGG CGCGCGCCTC CGGCGATCAG
GAGCGCGCGA CCTGGCTCGA ACGCGTCCCG GTCGAGCGCT TTCTCGAACT GCATGTCGCC
GCCTGGGTTT GGCCGTCCGG CGCGCAATCG ATGCTCCTCA GCCTGCGCGA TCTGTCCGAA
TCGCGCCGCG TCGAACGGAT GCGGGTCGAT TTCGTCGCCA ACGCCAGCCA TGAACTGCGC
ACGCCGCTCG CCTCCTTGCT CGGTTTTGTC GAGACGTTGC AGGGACCGGC GCGGGACGAT
TCCGCGGCGC GCGGGAAATT TTTGAAGATC ATGGGCGAGC AGGCGCGGCG CATGACCCGC
CTTGTCGACG ATCTTCTATC GCTGTCGCGG ATCGAGCAGC ATCTTCATCT TCTGCCGCAA
AGCCCGGTCG ATATGGCGGC GATCGTACGC CACATTGCCG ACACGCTGAC GCCGCTGGCG
CAGGATTCAG AGGTCGAGTT CATCGTCGAA ACCGCCGGCG TCATCGTGCC GGGCGACCGC
GACGAATTGC TGCGGGTCGT CGAAAATCTG GTCGAGAACG CCATCAAATA TGGCGCGAGC
GAAGCTCCGG ACGGCCTGCG CAGGGTCGAG ATCGCGCTGG TCGCGCAGGA GCGCCAATGC
GTGCTCAGCG TGCGCGATTT TGGCGCCGGC ATCGCGCCCG AGCATCTGCC GCGGCTGACG
GAGCGCTTCT ATCGCGTCGA CGCCGGCGCA AGCCGGGCCA AAGGCGGCAC GGGGCTCGGC
CTCGCCATCG TTAAACATAT TGTCGCGCGC CACCGCGGCC GCCTCGCCAT CGATTCGCAG
CCGGGGCAGG GGACGACCTG CACCGTCACC CTGCCGCTGC AGCCGCCGGC GACCCGCAGC
AGGGCGGGAT GA
 
Protein sequence
MRSSAQTVDM PARDRAMVAG LIDALPEAAI VIDQNDRVIA VNAPARALFP ALRRDLLLAR 
GLRAPDVFDC LARARASGDQ ERATWLERVP VERFLELHVA AWVWPSGAQS MLLSLRDLSE
SRRVERMRVD FVANASHELR TPLASLLGFV ETLQGPARDD SAARGKFLKI MGEQARRMTR
LVDDLLSLSR IEQHLHLLPQ SPVDMAAIVR HIADTLTPLA QDSEVEFIVE TAGVIVPGDR
DELLRVVENL VENAIKYGAS EAPDGLRRVE IALVAQERQC VLSVRDFGAG IAPEHLPRLT
ERFYRVDAGA SRAKGGTGLG LAIVKHIVAR HRGRLAIDSQ PGQGTTCTVT LPLQPPATRS
RAG