Gene Msil_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2474 
Symbol 
ID7091026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2706581 
End bp2708260 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content65% 
IMG OID643465795 
Producthistidine kinase 
Protein accessionYP_002362765 
Protein GI217978618 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.369133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGA ACGATAAAGT CAACATCCTG CTCGTCGACG ATCAGCCCGG CAAACTGCTG 
AGCTATGAAG CGATTCTCGA CGAGCTTCAG GAAAATCTCA TCAAGGCGGC CTCGGCGCGC
GAGGCCTTCG CGCATCTTCT CAAGACCGAG GTCGCGGTTA TATTGATCGA CGTCTGCATG
CCGGAGCTCG ACGGCTACGA GCTTGCGGGG ATGATCCGCG AACATCCGCG GTTCGAGACG
GTCGCGATCA TTTTCGTGTC GGCGATCCAG ATCACCGATC CCGACCGGCT GCGCGGCTAT
GAGGCCGGCG CCGTCGACTA TGTGCAGGTT CCTGTGGTTC CCGAAGTGTT GCGCGCCCAA
GTCAAGGTGT TCGCCGAACT GCACCGCAAG ACGCGCCAGC TTGAACGCCT CAATGAAGAA
CTCGAGGAGC GCGTCGCCGA GCGTACGGCG GCGCTCGAAC AATCGAGCGC GGAGCTGAAG
CGGCTCAATC AGGATCTCGA GACGCGCATC GACGCCCGCA CGCGCGAGCG CGAGCAGGCG
CTGGCGCAGT TGTTCGAGGC GCAGAAAGTG GACGCCATCG GACAGCTGAC CGGCGGCGTC
GCGCATGATT TCAACAATCT CCTGACGGCC ATCATGGGAA GCCTCGAGCT TTTGAAAAAG
CGTCTGCCCG ACGAGCCGCG CATCAGCCGC CTTGTCGACA CCGCCATGCA GGGCGCCGAG
CGCGGCGCGG CGCTGACGCA GCGTTTGCTC GCGTTCTCGC GTCGTCAGGA GCTAAAGCCC
GAAGCAGTCG ACGTCGCGCA GCTTGTCAAC GGCATGGAAG AGCTGCTCAG GCGCGCGCTC
GGCCCCGGCG TCAGCATCGA AAAGCGCTTG CCGGATGATC TTCGGTTCGC GCGCGTCGAC
GGCAATCAGC TCGAACTCGC CCTCATCAAC CTTGCCGTCA ATGCGCGCGA CGCGATGCCC
TCGGGCGGCG CCATCACCAT CGCCGCCGCA AACGAAATCG TCGCCGCGCA AAGCCCCGGC
AAGGCGATCA GCCCCGGCGC CTATGTGCGC ATCAGCGTTA TCGATGAAGG GCAGGGCATG
GACGCGGCGA CGCTCGCCAA GGCGGCCGAT CCTTTTTTCA CGACCAAAGG GCCAAGCAAG
GGCACGGGCC TCGGACTTTC GATGGTGCAG GGTCTTGCGG TTCAATCCGG CGGGGCGATC
GAAATATCGA GCCGCGTCGG CGCCGGCACC ACCGTAGAAC TCTGGCTGCC GCAGGCGGAA
TTCGACGAGC CGCGCAAACG CGCCGCGCCG GACGCGCGCA AAAGCTGGCG CGCCGATCTC
GCGCCCTGCA CGGTTCTCCT CGTCGACGAC GATCTCCTGG TCAGCGCGGG CGCCTCCTCG
ATCCTCGAAG ATCTGGGCCA CAGCGTCATC GAGGCTCATT CCGGGGCCGA GGCTCTGCGG
CTTCTGCAGA ACGGCAGCGC GCCCGATCTC GTCATCACCG ACTACGCCAT GCCGGGCATG
ACCGGGCTTG AGCTCGCCCG CGCCATCCGG GCCAGCTATC CGGCGCTGCC CATCGTGCTC
GCGAGCGGCT ATGCCGAACT GCCGCATCTT GGCCTCGACG AGAAGCCGCT GCCGCGCCTC
GCCAAACCCT TCCGGCAGGA TGAGTTGCTG GCGGCGATGG CCGAGGCCTC CGGCCGCTGA
 
Protein sequence
MAANDKVNIL LVDDQPGKLL SYEAILDELQ ENLIKAASAR EAFAHLLKTE VAVILIDVCM 
PELDGYELAG MIREHPRFET VAIIFVSAIQ ITDPDRLRGY EAGAVDYVQV PVVPEVLRAQ
VKVFAELHRK TRQLERLNEE LEERVAERTA ALEQSSAELK RLNQDLETRI DARTREREQA
LAQLFEAQKV DAIGQLTGGV AHDFNNLLTA IMGSLELLKK RLPDEPRISR LVDTAMQGAE
RGAALTQRLL AFSRRQELKP EAVDVAQLVN GMEELLRRAL GPGVSIEKRL PDDLRFARVD
GNQLELALIN LAVNARDAMP SGGAITIAAA NEIVAAQSPG KAISPGAYVR ISVIDEGQGM
DAATLAKAAD PFFTTKGPSK GTGLGLSMVQ GLAVQSGGAI EISSRVGAGT TVELWLPQAE
FDEPRKRAAP DARKSWRADL APCTVLLVDD DLLVSAGASS ILEDLGHSVI EAHSGAEALR
LLQNGSAPDL VITDYAMPGM TGLELARAIR ASYPALPIVL ASGYAELPHL GLDEKPLPRL
AKPFRQDELL AAMAEASGR