Gene M446_1489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1489 
Symbol 
ID6134486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1645789 
End bp1648113 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content61% 
IMG OID641641759 
Producthistidine kinase 
Protein accessionYP_001768428 
Protein GI170739773 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGG AAACATTTCG AATCAGCTCC CACTTGAAGG ACATCATCGG CCGTGACCTC 
GTGACGAACG AGTTCGTGGC ACTATTCGAA TTGGTAAAAA ACTCGTTCGA TGCAGGCGCA
ACCTCCGTTG ACATCGAGTT CGATCCGAAC AAGCGGTCGA TCGCTGTAGT TGACAACGGG
CGCGGCATGT CTGAGTCGGA TGTGCGCGAT AAATGGCTGT TCGTAGCCTA CTCCGAGAAG
GCACTCGTCG GTCGTAATGA CTATCGCAAC AAAATTCGCC CCGCAGGCCA ATTTGCCGGG
AGCAAGGGAA TAGGCCGCTT TGCTTGCGAT ACACTTGGCA GAAAATTGGA CCTGTACAGC
CGCGTGCAGG GTAGCAGCGC AATTTCTAAG CTCGAAATCG ATTGGCGCGA CTTCGAAGGC
GAGAGCACCA ACGAGTTTCA GGAAGTAAGC GTATCACTTG GATGTTCGCA GTCGTTTCCG
CCTCTGATGA ATCCCTCCCC GCCCGAGAAC AGCGGAACGG TGCTCTTGAT CAAGGAAACA
CGGCAGGATT GGGACGAGGA CAGCATACGT CGCCTCCGGC GGGACCTCGC CAAGCTAATC
GATCCCTTCG GGACGACGAG CGAGGTGACG TTGTCGACAT GGTTCGCCGA CGGCTCCGGG
GAGGAAATCG AGGGCGTCGA CGGGCCCGTG GGCAACGAGA TCGCGGAACT CCTGCGGGAC
AAGACCAGCC GGATCGAGGT TGTGATCGTC GACGGCTTTA TTGACACTAC GCTCTACGAT
CGCGGCCGGA AAATCTACGC GATCCGCGAG CCTTCCCTCT ATCCGGAACT CGCGGCATGC
CGGATAGAGG GGCAGGTGTT CTTCCTCAAT CGGTCGGCCA AGCACACATT CACGCTCCGC
ATGGGAGTAA GGCCTATCGA GTTCGGGAGT GTCTTCCTCT TCTTGAATGG CTTCCGCATC
CTTCCCATTG GTGAGGAGTT CGACGACACG TTTGGCTTGA ACCGACGGAA GCAACAGGGA
CAGGCCCGTT ATCTGGGCAC CCGCGACATC ATCGGCCGTG TAGACGTGAC CGCCCCGCCC
AAGATGTTTC GCGAGGTTTC GAGCCGGGAC GCTGGGCTCG TGGACGACGC GAACCGCCGT
GCGCTGTTTG AGGCTATCCG CCGGCACATG GTGTTCAAGC TCGAACGCTA CGTCGTGGGT
GTGAACTGGG CGGACAAGTC CGACCAAAAT CGCGACACGC CGGAAGGGCT GGAGACTGAC
CACGCCCGTG AGCGCATTCT CACGATCGTG GGAAGTCTGG CGCGGACGAG AGACATCGAG
ATCCTCTATT TTGACGAAGA TCTAGTCAGG GTATCCGAGG ATCCCGATCA GGTCACCGAT
AACGCGCTCA GGGCGATGTC GGACGTCGCG GAGAGTCGCG GTGATGCGAA GCTGCTGGAG
CAGGTCGAAG CGGCCCGGCG CAGGATCGCC GAGCTCAGGG CGCAACGGCA GGAAGCGCGC
GAGGTCGCGC AGCGCGCGAT TCAGGAACGG AACCGGGCTG ACGCCCGCAT CGCACGATTG
GAACAGCAGG CGGCGTTCCT CGGCAGCAGC CGGGATCTGG ACATTGAGCG AGTCCAACTC
CTGATGCACC AAGCCACCAT CCACGCGGGA CACGTTCGCT CCGCGGTGGC GAACGCTGCC
TACGAGATCA GGAACGTGCT GTCGCTGGCA GCGACGCAGG GCGAGCTGGA CGATCCGGAA
GAGATCGAGG ACTTGCTCGC GTCGATCCGG CAGTCGGCCC GCCGGGTCTC CGCCTCCATT
GCGGGTGCCA CACTCTCGGG CGACCGCCTG GGTGCAGTGC TCTCCTTCGC GCCCAACATT
CGGATCGACC TAAAAACTGA CAAAGTGGAG GGCGACCTGC TTCAGTTCCT CGCCGAGTAC
TTCACAGTCC GGCTGGCTGG CGTCCCCGGC ATGCCCGAGG CTGTCTTCCG AAATCCCGGA
TTGTCGCTGT TCCGCGAGTT CTCGCCCGTC GACATTGCCG TTCTCATCGA CAACCTGCAG
GACAACGCCC GCAAGCACCG AGCGTCCCGA ATCGAGTTCG TGGCTGCGAG GAAGGGTGCC
AATAGAGTGG TGATCAGGGT GTCTGACGAC GGGGTCGGTA TCAACGTGGA CCGGATTGAT
CCCGCGAAAA TCTTCGAGCG CGGCTACACG GGTTCGTCTA ATGGGACCGG GCTAGGCCTC
TACAGCGCCC GCCAAATTCT GCAGGAGATG GGCGGCTCGA TTGATCTGAT GGGGGACGGC
AGCCGCGCCG ACTTCGAGAT CGTAATACCC GGGGAAGGGG AATGA
 
Protein sequence
MAQETFRISS HLKDIIGRDL VTNEFVALFE LVKNSFDAGA TSVDIEFDPN KRSIAVVDNG 
RGMSESDVRD KWLFVAYSEK ALVGRNDYRN KIRPAGQFAG SKGIGRFACD TLGRKLDLYS
RVQGSSAISK LEIDWRDFEG ESTNEFQEVS VSLGCSQSFP PLMNPSPPEN SGTVLLIKET
RQDWDEDSIR RLRRDLAKLI DPFGTTSEVT LSTWFADGSG EEIEGVDGPV GNEIAELLRD
KTSRIEVVIV DGFIDTTLYD RGRKIYAIRE PSLYPELAAC RIEGQVFFLN RSAKHTFTLR
MGVRPIEFGS VFLFLNGFRI LPIGEEFDDT FGLNRRKQQG QARYLGTRDI IGRVDVTAPP
KMFREVSSRD AGLVDDANRR ALFEAIRRHM VFKLERYVVG VNWADKSDQN RDTPEGLETD
HARERILTIV GSLARTRDIE ILYFDEDLVR VSEDPDQVTD NALRAMSDVA ESRGDAKLLE
QVEAARRRIA ELRAQRQEAR EVAQRAIQER NRADARIARL EQQAAFLGSS RDLDIERVQL
LMHQATIHAG HVRSAVANAA YEIRNVLSLA ATQGELDDPE EIEDLLASIR QSARRVSASI
AGATLSGDRL GAVLSFAPNI RIDLKTDKVE GDLLQFLAEY FTVRLAGVPG MPEAVFRNPG
LSLFREFSPV DIAVLIDNLQ DNARKHRASR IEFVAARKGA NRVVIRVSDD GVGINVDRID
PAKIFERGYT GSSNGTGLGL YSARQILQEM GGSIDLMGDG SRADFEIVIP GEGE