Gene M446_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2081 
Symbol 
ID6134822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2324414 
End bp2327113 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content73% 
IMG OID641642310 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001768978 
Protein GI170740323 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00980702 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGCCC AGGCCTGCGG CACGCCGATG GCCCATATCA ACTTCATCGA CGCGGAGCGG 
CAGTGGATCA AGGCGGCGGT CGGGCACGAC CGGCGGGAGA TGCCGCTCGA CCTCGGGTTC
TGCGCCGAGG TGCTGCGGGC ACCCGACGTG GTGGAGCTCA CCGACCTCGA CCGTGATCCC
GCCCGCGCCG CCAACCCGCT GGTGGCCGAG ACCCCGCACC TGCGCTTCTA CGCGGGCGTC
CCGCTCCTCA CGCCCGAGGG CTACGCCATC GGGACGCTCT GCGTGCTCGA CCGCGTGCCG
CGGGAGCTGA CCGAGCCGCA GCGCTTCATC CTGCGGGCGC TGGCCCGGAC CGTGATGGCG
CAGCTCGCCA TGCGCCGCAC CGACGCGGCC CTGCGGGTGA GCGAGGAGCG CTACCGCTCG
CTGTTCAACT CGATCGATGC CGGGTTCTGC CTGATCGAGG TGCTCTTCGG CGACGGGGGC
AAGGCGGAGG ATTTCCGCAT CCTGGAGGTG AACCCGGCCT TCGCGGGCCA GACCGGCCTC
GCCGACCCGG TCGGGCGCAA GATGCGCGAC CTCGCCCCGG AGAACGAGCC CTACTGGCCC
GAACTGTTCG GCGAGGTGGC GCGGACCGGC CGGCCCGTCC GCGTCGAGAA CGCCGCCAGG
GCCCTCGGGC GCTGGTACGA CGTGCATGCC TACCGGGTCG ACCAGCCGGA CCTCAACCGC
GTCGCCGTGC TGTTCAACGA CATCACCCCG CGCCGCGTCG GCGAACTCGC CCTGCGCGAC
AGCGAGGCGG AGCTGCGCCT CGTCGCCGAC GCGATGCCGG TGCTCATCGC GGTGATCGAC
CGGTCCTTCA CCTACCGCTT CGTGAACGCC GCCTACGAGA CCTGGTTCGG GCATTCCCGC
GACGCCGTGA TCGGCCGCCA CATGGGCGAC CTCATCGGCG AGCAGGAATT CGCGATCCGC
CGCCCCTCCG TGGAGCGGGC GCTGGCCGGC GAGGAGGTGC GCATCGAGCG CACCTGGCCC
TGGCCGGACG GGCGCACCCG CATCGCGGAC ATCCGCTACC TGCCCCGCCG CAGGCCGGGC
GGCGAGGTCG ACGGCGTCTA CGTGTTCGTC CACGACGTCA CCGACCGCAA GCGCGTCGAG
GAATTGCTGG AGGCGCGCAC GCAGTCCCTG GAGGCGCAGA TCGCCGCCCA GGCCCGCGAC
CGCGACCGCA TCTGGACCCT CTCGCCGGTG CTGAAGGTGA TCGCCTCGGC GGGCGGCGCG
ATCCAGTCGG TGAATCCCTC CTGGGTGCGC ACGCTCGGCT GGAGCGAGGC GGAGTGCCTC
GGCCGCTCCC TCCTCGACTT CGTGGTCCCG GCGGAGCGGG GCACCCTGGA GGCGGAGCTG
GCGCGGCGGG CCGCCGGGGA GGGCGGCGAC GTCGAGATCG CCTGCCTCAC CAAGTCCGGG
GAGGCGCGGC GCATCCTCTG GACCATCGTG CCGGAGGACG ACACGCTCTA CGGGTTCGGG
CGCGACGTCT CCGAGCAGCG CCGGGCCGAG GAGGCGCTGC GCCAGTCGCA GAAGCTCGAG
GCGGTGGGCC AGCTCACCGG CGGGGTGGCG CACGACTTCA ACAACCTGCT CACCATCATC
CGCTCCTCCG TCGATTTCCT GCGCCGGCCC GACCTGCCGG AGGAGCGGCG GCGGCGCTAC
CTCGACGCGG TGTCGGACAC GGTCGACCGC GCCGCCAAGC TCACTGGGCA GCTCCTCGCC
TTCGCGCGGC GCCAGGCGCT GCGGCTCGAA GTGATCGATG TCGGGGCGCG GCTGCGCAAC
GTCGGCGAGA TGCTCGACGC CGTGACGGGC GCGCGCATCC GCGTGGTCAC GGAGGTGCCG
GACCGGCCCT GCTTCGTGCG CACCGACCAG AGCCAATTCG AGACCGCCCT CGTCAACATG
GCGGTGAATG CGCGCGACGC CATGAACGGG GAGGGGACCC TGACCCTGCG CGTCAGCTGC
GGCCGCTCCC TCCCGGCCAT CCGCGGCCAC GCGGGCGCGC CCGGCCCCTT CGCGACGGTG
TCGCTGACCG ATACCGGCAC CGGCATCCCG CCCGACCTGC TCGGCCGGAT CTTCGAGCCC
TTCTTCACCA CCAAGGACGT CGGCAAGGGC ACGGGGCTCG GCCTGTCCCA GGTCTTCGGC
TTCGCCAAGC AATCCGGCGG CGACATCGCG GTCGAGAGCA CCCCGGGGAC GGGGACCACC
TTCACCCTCT ACCTGCCGCA GGTCGAGGTG CCCGACGGGG CGCTCCGGCC GGACCTCGAC
CCGCAAGGCC CCTCGCCCTC GGGGGCGGGA CGGCGGGTGC TCGTCGTCGA GGACAATGTC
GAGGTCGGGC GCTTCGCCTG CCAGATCCTG CAGGATCTCG GCTTCACGAC CGAGTGGGCC
TGCAACGCCG AGGAGGCGCT CGACCGGCTC GGCGGGGAGG CGTCGGCGTT CGACGCCGTG
TTCTCGGACG TGGTGATGCC CGGCATGGGC GGCATCGCCC TCGCCCGCGA GCTGCGCCGG
CGGCTGCCGG ACCTGCCCGT GGTGCTCGCC TCCGGCTACA GCCACGTCCT GGCCCAGGAG
GGGGCGCACG GGTTCGAGCT GCTCCACAAG CCCTATTCGG GCGAGGAACT GGGGCGGATC
CTCGACCGGG TCACGTCCCG CGACGGCGGC CGCGTCGCCT CGCGCTCCCG AGGCGGGTAG
 
Protein sequence
MAAQACGTPM AHINFIDAER QWIKAAVGHD RREMPLDLGF CAEVLRAPDV VELTDLDRDP 
ARAANPLVAE TPHLRFYAGV PLLTPEGYAI GTLCVLDRVP RELTEPQRFI LRALARTVMA
QLAMRRTDAA LRVSEERYRS LFNSIDAGFC LIEVLFGDGG KAEDFRILEV NPAFAGQTGL
ADPVGRKMRD LAPENEPYWP ELFGEVARTG RPVRVENAAR ALGRWYDVHA YRVDQPDLNR
VAVLFNDITP RRVGELALRD SEAELRLVAD AMPVLIAVID RSFTYRFVNA AYETWFGHSR
DAVIGRHMGD LIGEQEFAIR RPSVERALAG EEVRIERTWP WPDGRTRIAD IRYLPRRRPG
GEVDGVYVFV HDVTDRKRVE ELLEARTQSL EAQIAAQARD RDRIWTLSPV LKVIASAGGA
IQSVNPSWVR TLGWSEAECL GRSLLDFVVP AERGTLEAEL ARRAAGEGGD VEIACLTKSG
EARRILWTIV PEDDTLYGFG RDVSEQRRAE EALRQSQKLE AVGQLTGGVA HDFNNLLTII
RSSVDFLRRP DLPEERRRRY LDAVSDTVDR AAKLTGQLLA FARRQALRLE VIDVGARLRN
VGEMLDAVTG ARIRVVTEVP DRPCFVRTDQ SQFETALVNM AVNARDAMNG EGTLTLRVSC
GRSLPAIRGH AGAPGPFATV SLTDTGTGIP PDLLGRIFEP FFTTKDVGKG TGLGLSQVFG
FAKQSGGDIA VESTPGTGTT FTLYLPQVEV PDGALRPDLD PQGPSPSGAG RRVLVVEDNV
EVGRFACQIL QDLGFTTEWA CNAEEALDRL GGEASAFDAV FSDVVMPGMG GIALARELRR
RLPDLPVVLA SGYSHVLAQE GAHGFELLHK PYSGEELGRI LDRVTSRDGG RVASRSRGG