Gene M446_4874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4874 
Symbol 
ID6132968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5353062 
End bp5354696 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content72% 
IMG OID641645010 
Producthistidine kinase 
Protein accessionYP_001771637 
Protein GI170742982 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.300744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0460746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CGCCCGAGCT GCCGCCGCCC GGACGCGCAC CGACCGTGAA GGACGGCCCG 
ACCGTCCAGG ACGGCGCGTC CGGCGAGGTC GCGGATCACC GCAGCGACAT CTTCTTCGCC
GCGGTCGAGA CGACCCGGAT GCCGATGATC GTCACCGACC CGCGCCAGCC CGACAACCCG
ATCATCTTCG CCAACCAGGC CTTCCGGGCC CTGACGGGCT ACGACCCGTC CGAACTGATC
GGCCGCAATT GCCGCTTCCT GCAGGGTCCC GAGACCGACC CGCAGACGAT CGCGGAGGTG
CGGCGCGCCA TCAAGGAGCG GCGCGAGATC TCGACCGAGA TCCTGAATTA CCGCAAGAAC
GGCTCCTCCT TCTGGAACGC GCTGTTCGTC TCGCCGGTCT ACAACGACGC GGGCGACCTC
GTGTACTTCT TCGGCTCGCA GCTCGACGTC TCGCGCCGCC GCGACGCCGA GGACGCCCTG
CGGCAGGCGC AGAAGATGAA GGCGCTCGGC CAGCTGACCG GGGGCATCGC GCACGATTTC
AACAACCTGC TGCAGGTGGT GGTCGGCTAC GCCGACATCA TGCAGGCGGG GCTCGCCCAT
CCGCGGCCGG ATCCCGGCCG GCTGCTGCGG GCGGCGGAGA ACATCCGCGG CGCCGCCGAC
CGGGCGACGA CGCTGACCCA GCAGCTCCTC GCCTTCGCCC GCAAGCAGCG CCTGGAGGGG
CGCACCGTCA ACCTCAACGG CCTCGTCGAG GGCATGCGCG ACCTCGCCAA CCGCACCCTC
GGCGACGCGG TCACGGTCGT CACCGAGCTG GCCCCCGAGC TGCGCAACGC CCGGCTCGAC
CCGACCCAGA CCGAGGTCGC GCTCCTCAAC GTGCTGATCA ACGCCCGCGA CGCGATGCCG
TCCGGCGGCA CGGTCACGAT CCGCACGGAG AACCGCGAGA TCGGCCCCGA CGAGATCGGC
CCCGGCCTGC CCCCGGCCGG CCGCTTCGTC GCGATCAGCA TCACGGATAC CGGCACCGGC
ATGCCGCCCG AGGTGCTCGC CCGGGTGACG GAGCCGTTCT TCACCACCAA GGAGGAGGGG
CGCGGCACCG GCCTCGGCCT GTCGATGGTC TACGGCTTCG CCAAGCAGTC GGGCGGCACC
CTCCAGATCG AGTCGCGGGT CGGCGAGGGC ACCCGGGTGC GGCTGATGTT CCCGGCCGCC
GGCGAGGACG AGCGCCCGGC GCCGGTGCGC CTGCGGGGGA CCGAGCGCCC CGGCACCGAG
ACGATCCTGA TCGTCGACGA CCGCCAGGAC GTGGCCGAAC TCGCGCGCAC CATCCTGCAG
GATTTCGGCT ACACGGTCCT GACGGCCGCC AACGGGCGCG AGGCCCTCGA CGTGCTCGAC
GGCCACCGCC GGGTCGACCT GCTGTTCTCC GACCTGATCA TGCCGGGGGG CATGAACGGC
GTGGTGCTGG CCCGGGAGGC CCGCCGCCGC CAGCCCCGGC TCAAGGTGCT GCTCACCACC
GGCTACGCGG AGGCGAGCCT GGAGCGCACC GATGTCGGGG GCAGCGAGTT CGAGGTGATC
AACAAGCCCT ATCGCCGCCT CGACCTGGTG CGCCGCGTGC GGGCCGTGCT GGACGGCCCG
AACGGGGTGA GCTGA
 
Protein sequence
MTTTPELPPP GRAPTVKDGP TVQDGASGEV ADHRSDIFFA AVETTRMPMI VTDPRQPDNP 
IIFANQAFRA LTGYDPSELI GRNCRFLQGP ETDPQTIAEV RRAIKERREI STEILNYRKN
GSSFWNALFV SPVYNDAGDL VYFFGSQLDV SRRRDAEDAL RQAQKMKALG QLTGGIAHDF
NNLLQVVVGY ADIMQAGLAH PRPDPGRLLR AAENIRGAAD RATTLTQQLL AFARKQRLEG
RTVNLNGLVE GMRDLANRTL GDAVTVVTEL APELRNARLD PTQTEVALLN VLINARDAMP
SGGTVTIRTE NREIGPDEIG PGLPPAGRFV AISITDTGTG MPPEVLARVT EPFFTTKEEG
RGTGLGLSMV YGFAKQSGGT LQIESRVGEG TRVRLMFPAA GEDERPAPVR LRGTERPGTE
TILIVDDRQD VAELARTILQ DFGYTVLTAA NGREALDVLD GHRRVDLLFS DLIMPGGMNG
VVLAREARRR QPRLKVLLTT GYAEASLERT DVGGSEFEVI NKPYRRLDLV RRVRAVLDGP
NGVS