Gene M446_3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3914 
Symbol 
ID6132596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4361103 
End bp4364171 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content73% 
IMG OID641644072 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001770714 
Protein GI170742059 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.1 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0660292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCCT GCAGCCTGCC GCCCGAGCCG CCCGACCACC CCTTCATCCC GGGTGGCAGC 
GAACTCGGCG CGCTCGTGCG CGCCCACGAT TGGGCGGCGA CTCCCCTCGG GCCCCTCGAA
ACCTGGCCCC AGAGCCTGCG CACCGCGGTC GGGATCGTGC TGCTCTCGCC CGTGCCCATC
GTGATGCTGT GGGGCGAGGA CGGCATCATG ATCTACAACG ACGCCTACTC GGTCTTCGCC
GGGGGACGGC ACCCGCAGCT CCTCGGCTCC AAGGTGCGCG AGGGCTGGCC CGAGGTCGCC
GATTTCAACG ACCACGTCAT GAAGGTGGGG CTCTCCGGCG GCACGCTCGC CTACAAGAAC
CAGGAACTGA CCCTGCACCG CTACGGCCGG CCCGAGCAGG TCTGGATGGA CCTGGACTAC
TCGCCGATCC TCGACGAGCG CGGCCGGCCC GCCGGGGTCG TCGCCATCGT GGTCGAGACC
AGCGAGCGCG TGCGGGCGGA GCGCCGCGAG GCCTTCCTGG CCGGGCTCGC CGACGCGCTG
CGCGACCTCG CCGACCCGCT CGCCGTGCGG GAGGCCGCGA CCCGCAGCCT CGGCCTCCAT
CTCGGCGCGG ACCGCGTCGG CTACGGCGAG GTGAGCGACG CGGACGGCGC CCCGCGCCTG
ACCATCGCGC AGGATTGGTG CGCGCAGGGC GTCGCCTCCA TCGCCGGCCA GCACGACATG
ACCCGCTACG CCTCCGCCTT CCTGGCGGAT TTCCTGGCCG GCCGGACGGT CGTGTTCGAG
GACATGCGCA GCGATCCCCG GACCGCCGGC CAATCCTCGG AGGCGGCCCA CGCGCGCCTC
GCCGTCCGGG CGCAGATCGT CGTGCCGCTG GTCAAGGCCG GGCGCCTCTC GGCCGTCCTC
TTCGTCCACT CGGTCGAGGC CCGGCCCTGG TCGGCCGACG ACGTGTCGCT GGTCGGGGAG
GTGGCCGAGC GCACCTGGGC CGCCGTCGAG CGGGCCCGGG CCGAGCGCAT GGTCCAGGAG
CGCAACGCGC GGCTGGAGAT CCTCGCCGAG GCGATCGAGC GCGCGCCCGC GGCGCGCACG
CTGGGCGAGC TGATGGAGAT CGTGGGGGCC GCGGCGCGGC GCCTGTCGGG GGCGGACGGC
GTCACGGTCG TGCTGCGCCG GGGCGAGCAG TGCTTCTACG CCATCGAGGA CGCGGTGCAG
CCGCTCTGGA AGGGACGCCG CTTCCCGCTC GTCTCCTGCA TCTCGGGCTG GGCCATGCTC
AACCGCCGGA CCGCGATCGT GGCGGATGTG CGCGCGGATT CCCGCGTGCC GCAGGATCTC
TACACGCCCA CCTTCGTGCG CAGTCTCGTG ATGGTGCCGA TCCTGGCGGA CGGCGAGGCG
ACCGCCGCCA TCGGCGCCTA CTGGCCGGAG GTCCATCGGC CGCCCGAGGG CGAGGTCGCG
ACGCTGGAGG CGCTCGCGCG CACGGCGGGC GCGGTGCTGC GGCGCCTCGA CGCCGAGGAG
GCGCTGCGCC GCCTCAACGA GACCCTGGAG ACGCAGGTCG CCGAGCGCAC CGCCGACCGG
GACCGCATGT GGCGGCTCTC GACCGACGTG ATGCTGGTGG CCCGCTTCGA CGCCACGATC
ACGGCCGTGA ACCCGGCCTG GACGACCCTG TTCGGCTGGC GCGAGGACGA CCTCGTGGGC
GGCCGGTTCA CGGATTTCGT CCACCCGGAG GACAGGGAGG CGACGCGGGA GGAGGTCGGC
CGGCTCTCCG AGGGGCTGAC GACGCTGCGC TTCGTCAACC GCTACCGGCA CCGGGACGGC
AGCTACCGCT GGCTGTCCTG GACCGCGGTG CCGGACGAGG GGCTGATCCA CGCGGTCGGC
CGGGACATCA CCGTCCAGCG CGCCCAGGGC GAGGCCCTGG CCAAGACCGA GGAGGCGCTC
CGGCAGGCTC AGAAGATGGA GGCGGTGGGC CAGCTCACCG GCGGCCTCGC CCACGACTTC
AACAACCTGC TCACCGGCAT CGCCGGCTCC CTCGAATTGC TGCAGACGCG CGTGGCGCAG
GGCCGGACGG GCGAGCTCGA CCGCTACATC GAGGCGGCGC AGGGGGCCGC CAGGCGCGCC
GCGGCGCTCA CTCACCGGCT CCTCGCCTTC TCGCGGCGCC AGACCCTCGC CCCCAAGCCC
ACCGACGTGA ACCGGCTCGT GGCGGGCATG GAGGAGCTGA TCCGGCGCAG CATCGGCCCG
GCGATCGCCC TGGAGATCCT GGCCGGGGAC GGCCTCTGGC CGATCCTGGT CGATCCGAGC
CAGCTCGAGA ACGCGCTGCT GAACCTGTGC ATCAATGCCC GCGACGCGAT GCCGGACGGG
GGCCGCATCA CGATCGAGAC CTCCAATGCC TGGCTCGACG AGGAGGCGGC CCGCCCGCTC
GATGTCCCGG CGGGCGAGTA CCTGCTCCTG TGCGTGACGG ATACCGGCAC CGGCATGCCG
CCCGACGTGA TGACCAAGGT CTTCGACCCG TTCTTCACGA CGAAGCCCCT CGGCGAGGGC
ACGGGGCTCG GCCTGTCGAT GATCTACGGC TTCGTCCGGC AATCCGGCGG GCAGGTGCGG
ATCGCCTCCA CCCTCGGCCA GGGCACGACG ATGGGCCTCT ACCTGCCGCG GCACCGCGGC
GAGGCGGAGG CGCCCGAAGC CTCGCCCGCC CTCGCCGTCG CCCCCCGGGC CGGGCAGGGC
GAGACGGTGC TGATCGTCGA CGACGAGCCG AGCGTGCGCA TGCTCGTCAC CGAGGTGCTG
GAGGATCTCG GCTACGTCGC CATCGAGGCG GCGGACGGGC CGTCGGGCCT GAAGGTGCTG
CAATCCGGCG CGCGCATCGA CCTCCTGATC ACCGATGTGG GGCTGCCCGG CGGCATGAAC
GGCCGGCAGG TCGCCGACGC GGCCCGGGTC ACCCGCCCGG ACCTCAGGGT GCTGTTCATC
ACGGGCTACG CGGAATCGGC GGCGATCGGC CGGGGGCTGC AGGAGGCCGG CATGGCGATC
CTGACCAAGC CGTTCGTCAT GGACACGCTC GCGAGCCGCA TCAAGGACCT CATCTCCGGC
CAGGGCTGA
 
Protein sequence
MNACSLPPEP PDHPFIPGGS ELGALVRAHD WAATPLGPLE TWPQSLRTAV GIVLLSPVPI 
VMLWGEDGIM IYNDAYSVFA GGRHPQLLGS KVREGWPEVA DFNDHVMKVG LSGGTLAYKN
QELTLHRYGR PEQVWMDLDY SPILDERGRP AGVVAIVVET SERVRAERRE AFLAGLADAL
RDLADPLAVR EAATRSLGLH LGADRVGYGE VSDADGAPRL TIAQDWCAQG VASIAGQHDM
TRYASAFLAD FLAGRTVVFE DMRSDPRTAG QSSEAAHARL AVRAQIVVPL VKAGRLSAVL
FVHSVEARPW SADDVSLVGE VAERTWAAVE RARAERMVQE RNARLEILAE AIERAPAART
LGELMEIVGA AARRLSGADG VTVVLRRGEQ CFYAIEDAVQ PLWKGRRFPL VSCISGWAML
NRRTAIVADV RADSRVPQDL YTPTFVRSLV MVPILADGEA TAAIGAYWPE VHRPPEGEVA
TLEALARTAG AVLRRLDAEE ALRRLNETLE TQVAERTADR DRMWRLSTDV MLVARFDATI
TAVNPAWTTL FGWREDDLVG GRFTDFVHPE DREATREEVG RLSEGLTTLR FVNRYRHRDG
SYRWLSWTAV PDEGLIHAVG RDITVQRAQG EALAKTEEAL RQAQKMEAVG QLTGGLAHDF
NNLLTGIAGS LELLQTRVAQ GRTGELDRYI EAAQGAARRA AALTHRLLAF SRRQTLAPKP
TDVNRLVAGM EELIRRSIGP AIALEILAGD GLWPILVDPS QLENALLNLC INARDAMPDG
GRITIETSNA WLDEEAARPL DVPAGEYLLL CVTDTGTGMP PDVMTKVFDP FFTTKPLGEG
TGLGLSMIYG FVRQSGGQVR IASTLGQGTT MGLYLPRHRG EAEAPEASPA LAVAPRAGQG
ETVLIVDDEP SVRMLVTEVL EDLGYVAIEA ADGPSGLKVL QSGARIDLLI TDVGLPGGMN
GRQVADAARV TRPDLRVLFI TGYAESAAIG RGLQEAGMAI LTKPFVMDTL ASRIKDLISG
QG