Gene Msil_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2020 
Symbol 
ID7094218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2190654 
End bp2192969 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content61% 
IMG OID643465344 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_002362322 
Protein GI217978175 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGACG AGACGCCCGC TTTCGGCGCC GCAGACCTGT CAAACTGCGA CCGCGAGCCA 
ATCCATATTC CAGGGTCGAT TCAGCCCCAT GGCGCCTTGC TGGCGCTCGA CGCCGACAGT
CTGACGGTGG CGCAGGCAGG CGGCAATACA AGACAGCTCC TCGGCTTTGC GCCGCGCGAT
CTGATCGGCA AACCGATCGA AGACTGGCTC TCGGCCGGGC GGATCGGCCG GCTGCGGGAC
TTGTTGGATA ATGAGGGGGC GATGATCCGC CCCTTGCATG CTTTCCGCAT GAACGCCGTG
GGTGGCAACC GTGGCGTCGA CGTCACCGCG CATTTCAGCG ACGGATCGCT CATTCTCGAA
TTCGAGCCGG TCTATGAGGA CATTGTCGAT GATTCGCTGG CGCTCGTTCA GACCATGATC
CGCAGCGTGC AGGAGGCCGA TTCTGTCGCG GCTTTCTGCC AATCCGTCGC CGACGTCGTG
CGTCAGGCGA CGAATTTCGA CCGCGTCATG GTCTACCGCT TTCTGCCCGA CGGCAGCGGC
GCGGTCGACG CGGAAGCCAA AGACCCGGCG CTGGCGCCTT TCCTCGGATT GCACTACCCG
GCGTCCGACA TTCCAAAGCA GGCCCGCGAT CTCTATCTGC GCAACTGGAT CAGGCTGATC
GCCGACGCGC GCTATGAACC GGCCCCGCTC GAACCGGTAT TCGACGCGCA GGAACGGCGC
CCGCTCGATC TGAGCCAGAG CGCATTGCGC AGCGTCTCGC CAATTCATCT TGAATATCTG
GGAAATATGG GCGTCGCCGC GACCATGTCG CTGTCGATCA TTCTCGACGG CAAATTGTGG
GGCCTGGTGG CCTGCCATTC ACGGACGCCG CGTTTTGTCG CGCATCGCCT GCGCGTCGCG
CTCGAACTTT TCAGCCAGAT GGCTTCGTTC CTTCTCGAGA CCAAAATCAC TGCCGCCGAG
CTCGAGCTTA GATCCCGGTC CAAAGTCCTG CACGACAGAC TTTTGACGCA TCTTGCGGGG
GTTGGGGAAC TTGCGGACGC TCTGGAAAAG CTGCGTCCGA GCTTGCTCGA CATTATATCC
GCTGATGGAC TGGGGCTCTG GATCGAGGGC CGCTATACAC ATCTCGGACG GGCGCCGGAC
GCGGATCAGG CGGCCGGGCT CGTGGGCTGG CTAAACGAGA CCGCGGAGGA CGGCGTTTTC
CATACTTCGG CGCTGCCTAA GCTCTATGCG CCCGCCGTCG ACTTTGCAGA TGTGGCTAGC
GGAATCATCG CGCTTTCGGT GTCGAAGACT CCGCGTGATT ATGTCATCTG GTTCCGGCCG
GAAATCATCG AGACTGTGAC CTGGGCTGGC AACCCCGATA AGTCGGTGAG CGCCGAGCCC
GATGGGCAAC GCCTGTCGCC GCGAAAGAGC TTCGCCGCGT GGAAGCAGGA GGTCCGACTG
CAGTCGCGCC CGTGGAGCAG CGTCGCAATT CAGACCGCGC AGGCGCTCAG GGTTTCGCTG
CTCGAGGTTG TGCTGCGACG CGTCGACCAA ATTGCGCGCG AGCGTGAGAC TGCGCGCCAG
CGTCAGGACG CCCTGCTTGT GGCGCTTGAC CATCGGATCC GTCAATGGGA GACGACGGCG
CAGCAACTGA AGATCGAATC GGATCGCCGC GCCGTTGTAG AGGCCGAGCT GTCCGAAGTC
TTGCGCAGCA CGGTCATCAA TCAGGAAGCG GAACGACAGC GGATTGCGCG CGAATTACAC
GACAGTCTCG GGCAATATCT GACTGTAATG CAACTCGATC TCGACGGGAT CGGCCGTGAC
GTCGATTCAT CTCCAGCAGT GAAGCGACGC GTGGCGGATC TCAAGAACCT GACGGCGAAT
CTTGGCAAAG AGGTCAATCG CCTCGCCTGG GAAATCCGGC CGACGGCCCT GGACGATCTT
GGCCTTCAGA CGGCAATTCA GCAATTTTTA GAGGAATGGG GCCAGAAATC GGGGCTGCAG
TTTGATTTGC ACCTCGCCTT GAGCGATCGG CGACTACCGC CTATTGTCGA GACGACGCTC
TATCGCATTC TTCAGGAAGC GATCACGAAT GTCGTCAAGC ATTCACAGGC AAAAAAAGCG
GGCGTCATTC TGAAGGCGAC GTCGAGCGAG GCGATCATGA TTGTGGAGGA CGACGGCAGG
GGCTTTCTCT GGGATGACGT CGACTCGGCG ACAAAGCCCT CCTCCCGCCT TGGGCTTCTT
GGCGTGCGTG AGCGCTTGTC CCTTGTAGGA GGCAAACTGG AGATTGAAAC CAGCCCGGGG
CATGGCGCGA CCCTCTTCAT TCATGTTCCG CTCTAA
 
Protein sequence
MDDETPAFGA ADLSNCDREP IHIPGSIQPH GALLALDADS LTVAQAGGNT RQLLGFAPRD 
LIGKPIEDWL SAGRIGRLRD LLDNEGAMIR PLHAFRMNAV GGNRGVDVTA HFSDGSLILE
FEPVYEDIVD DSLALVQTMI RSVQEADSVA AFCQSVADVV RQATNFDRVM VYRFLPDGSG
AVDAEAKDPA LAPFLGLHYP ASDIPKQARD LYLRNWIRLI ADARYEPAPL EPVFDAQERR
PLDLSQSALR SVSPIHLEYL GNMGVAATMS LSIILDGKLW GLVACHSRTP RFVAHRLRVA
LELFSQMASF LLETKITAAE LELRSRSKVL HDRLLTHLAG VGELADALEK LRPSLLDIIS
ADGLGLWIEG RYTHLGRAPD ADQAAGLVGW LNETAEDGVF HTSALPKLYA PAVDFADVAS
GIIALSVSKT PRDYVIWFRP EIIETVTWAG NPDKSVSAEP DGQRLSPRKS FAAWKQEVRL
QSRPWSSVAI QTAQALRVSL LEVVLRRVDQ IARERETARQ RQDALLVALD HRIRQWETTA
QQLKIESDRR AVVEAELSEV LRSTVINQEA ERQRIARELH DSLGQYLTVM QLDLDGIGRD
VDSSPAVKRR VADLKNLTAN LGKEVNRLAW EIRPTALDDL GLQTAIQQFL EEWGQKSGLQ
FDLHLALSDR RLPPIVETTL YRILQEAITN VVKHSQAKKA GVILKATSSE AIMIVEDDGR
GFLWDDVDSA TKPSSRLGLL GVRERLSLVG GKLEIETSPG HGATLFIHVP L