Gene Msil_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3591 
Symbol 
ID7092450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3949908 
End bp3951860 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content60% 
IMG OID643466880 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_002363839 
Protein GI217979692 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGG ACGAGCGATT GAGCGGATCG GAGGCCGAAG AGGGGCGGTT CCGACTCCTG 
CTCGACGCCG TAACCGACTA TGCGATCTAT ATGCTTGATC GTGATGGCGT CGTCACAAGC
TGGAACACGG GCGCGCGGCG GTTCAAAGGG TACGAAGAAT CCGAAATTCT CGGGCGGCAT
TTCTCGACGT TTTATACCGA GGAAGACCGC AAGGCTGGAT TGCCCGCCAG AGCGCTCGAG
ACTTCGGCCA GGGAAGGGAA ATTTGAAGCC GAGGGTTGGC GTCTGAAAAA GGACGGTTCG
CGCTTCTGGG CGCATGTCGT CATCGATCCC ATCCGTCAGT CGGGAAAACT TGTGGGATTC
GCCAAAGTCA CGCGCGATCT CACGGAACGG CGCGCCGCGG AGGCGGCGCT TCGGCGCACC
GAGCATATGT TTAAGCTGCT CGTTCACGGC GTGACGGACT ATTCAATCTA CATGCTCGAT
CTGGATGGGC GCGTGGCCAC CTGGAATCAG GGCGCACAAC GGATAAAGGG CTATCTGCCG
GACGAAATCA TAGGCGAACA TTTCTCGCGC TTCTATACAC AGGAAGATCT CGCTACGGGC
GAGCCGCGGC GCGCGCTTGA AACAGCATAC CGAGAGGGAC GATTCGAAAA GGAGGGATGG
CGGGTCCGAA AGGACGGCAG CCGTTTTTGG GCCAATGTCG TCATGGACGC CATCCGCGAC
GAAAGCGGCG CCGTGCTGGG TTTTGCTAAA ATTACGCGCG ATATCACGGA GCGGCGCGAC
GCCCAGCGCG CGTTGGAACT GGCGCGGGAG GCGTTGTTCC AGTCACAGAA ATTGGACGCC
ATAGGGCAGC TTACGGGCGG CGTGGCGCAC GACTTCAACA ATCTCCTGAT GGCGATCTTG
GGTAGTCTCG AGCTCGTTCA GAAGCGTCTT CCCGCCGATC CGAAAATCGC GTCCCTGATC
GATAATGCAA TCTTGGCGGC CCAGCGTGGC ACATCGCTGA CACAGCGCAT GCTCGCTTTT
GCGCGGCGCC AAGAGCTGGA GCTCGAGCCG GTGGACGTAT TGGCGCTCGT GCGTGGAATG
ACGGATCTTC TCCAGCGCTC CATTGGCCCC TCTGCGCCGA TCGAGGTGCG ATTTCCGCTG
GCGCTCGAGC CTGTTCAGGC GGACGCCAAT CAGTTGGAGC TGGCGCTGCT CAACCTCGTC
GTAAACGCGC GGGATGCGAT GCCGAACGGG GGCGCGATCA TCATTGCTGC CCGACAGGAA
GCGATCGTCG AGCAAGCCGC CGTGGGCCTC GCCCCGGGCC GCTATATTTG TCTGTCGGTC
CAGGATACGG GCGAAGGCAT GGATCAAGCG ACGCTGGATC GAGCAAAAGA ACCGTTTTTC
ACAACCAAGG GGGTCGGCAA GGGCACGGGT CTGGGATTGC CGATGGTTCA CGGCGTCGCC
GAGCAGTCGG GCGGGCGCCT GATTTTGAAG AGCCAGAAAG GCGCCGGCAC GACGGCAGAG
ATTTGGCTGC CTGCTGCGAC AACGGCGTTG CAGCCTTCGC TCGCCAAACT TACGCCTTCA
GAGAGTTGCC CGGTCACTCA TCCGCTAACG GTGTTGGCCG TCGATGATGA TCATCTTGTT
CTGACGAATA CCGCTGCGAT GTTGGAGGAT CTTGGGCACA AGGTCTTTAC CGCGCTGAGC
GCTGAGCAAG CCCTTAACGT TTTGAGATGC GAGAAAACGG TCGATCTTCT CATAACGGAT
CAGGCCATGC CGTTCATGAC AGGCACGCAA CTCACCGACG CAATTCGCGA AGAGCGGCGA
GATTTGCCTG TGATCTTGGC GACCGGCTAC GCCGAGTTTC CGCCAGGAAC GGCGGAGGAT
CTGCTACGGC TTGCAAAGCC ATTCGGTCAA ATGCAGCTCG CGCGCGCCAT CTCAAGGGTG
ATTGGGACGC GTGGCCAAAG GCGGGAGCCT TAA
 
Protein sequence
MNQDERLSGS EAEEGRFRLL LDAVTDYAIY MLDRDGVVTS WNTGARRFKG YEESEILGRH 
FSTFYTEEDR KAGLPARALE TSAREGKFEA EGWRLKKDGS RFWAHVVIDP IRQSGKLVGF
AKVTRDLTER RAAEAALRRT EHMFKLLVHG VTDYSIYMLD LDGRVATWNQ GAQRIKGYLP
DEIIGEHFSR FYTQEDLATG EPRRALETAY REGRFEKEGW RVRKDGSRFW ANVVMDAIRD
ESGAVLGFAK ITRDITERRD AQRALELARE ALFQSQKLDA IGQLTGGVAH DFNNLLMAIL
GSLELVQKRL PADPKIASLI DNAILAAQRG TSLTQRMLAF ARRQELELEP VDVLALVRGM
TDLLQRSIGP SAPIEVRFPL ALEPVQADAN QLELALLNLV VNARDAMPNG GAIIIAARQE
AIVEQAAVGL APGRYICLSV QDTGEGMDQA TLDRAKEPFF TTKGVGKGTG LGLPMVHGVA
EQSGGRLILK SQKGAGTTAE IWLPAATTAL QPSLAKLTPS ESCPVTHPLT VLAVDDDHLV
LTNTAAMLED LGHKVFTALS AEQALNVLRC EKTVDLLITD QAMPFMTGTQ LTDAIREERR
DLPVILATGY AEFPPGTAED LLRLAKPFGQ MQLARAISRV IGTRGQRREP