Gene M446_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2939 
Symbol 
ID6131204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3254003 
End bp3255346 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content72% 
IMG OID641643130 
Producthistidine kinase 
Protein accessionYP_001769785 
Protein GI170741130 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.012617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCCG CACCCGCCGA GGAGGTGGCC GCGCTGCACC GCCGCATCGC CAAGCTGGAG 
CGCATCAACG CGGCCCTGAT GTCCCAGGTC GAGCGCTCGA TGGACCAGCA GGGCACCGCC
TACTCGCTGT TCCAGACCGC CATCACCCTC GAGGGGCAGG TGCGCGCCAG GACCGAGGAG
CTGACGGCGC TGATGCGCAG CCTCGAACGC TCGAACCAGG CGCTGATCGC CGCCAAGGAG
GAGGCGGAGC GGGCCAACCG CTCGAAGACC CGCTTCCTGA CCGCGGCGAG CCACGACCTG
CTCCAGCCCC TCAACGCGGC GCGGCTGTCG CTCTCGGCCC TCGGCGACAT GCCGGTGGGG
GCGGAGGCGC TCGCGATCGT CGGCCAGGTC GAGCGCGGGC TCCAGACGAT CGAGGACCTG
ATCAAGACGC TCCTCGACAT CTCCAAGCTC GATGCCGGGC TGATCCAGCC GCAGCCGCGC
AGCCTGAAGC TCGCGGACGT GTTCGCGAGC GTCGAGGCGA GCTTCGGCCC GCTCGCGGCC
CGCAAGGGGC TGCGGCTCAG CGTGCGCCCG GGCGACCTCT GGGTGTCGAG CGATCTCGTG
CTGCTGCAGC GGATCATCCA GAACCTGGTC TCGAACGGGA TCCGCTACAC CCGCACGGGT
GGCGTCCTGG TCGCGGCCCG GCCGCGCGGC GGGGACGTGC GCGTCGACGT GATCGATTCG
GGCGCGGGCA TCCCGGAGGC GGACCGGGAG CTGATCTTCG AGGAGTTCCA CCGGGGCGGG
CGCGAGAGCG TCGACGGGGA GATCGCGCTC GGACTCGGCC TCTCCATCGT GCGCCGGTCC
GCCGAGGCGC TCGGCCACCG GCTGAGCCTC GTCTCCCGGG TCGGCCACGG GTCCCGCTTC
ACCCTGATCC TGCCGCGCGC CCAGGCCGAG GCGCCCCGGC CGGCCCAGCC CCTCGCCCTG
CCGACGAGCC TCACGGGGGC GCGCATCGCC GTGATCGAGA ACGACCGGGC GGCGCTGGAG
GCGCTCGCCC GGCTGTTCCA GAACTGGGAC GCGCACGCGC TGGCGGCCCG CGACCATCTC
AGCCTGATCC GGCTCACCGG CTCCACCTGG AGCCCGGACG TGATCATCGC GGATTACCAC
CTCGACGGCG GAGCCTGCGG CCTCGACACG GTGACGTGGC TTCGCACCGT GCATGGCGGC
GACATCCCGG CCGTCGTGAC GACCGCCGAC CACTCGCCGG AGGTCGAGGC CCATGTCCGT
GCGGCAGGCT GCGAGCTGAT CCACAAGCCG ATCAAGCCCG GTCAGCTGCG CGCGCTGCTG
GCGCATCTCC TGAGCAAATC TTGA
 
Protein sequence
MEAAPAEEVA ALHRRIAKLE RINAALMSQV ERSMDQQGTA YSLFQTAITL EGQVRARTEE 
LTALMRSLER SNQALIAAKE EAERANRSKT RFLTAASHDL LQPLNAARLS LSALGDMPVG
AEALAIVGQV ERGLQTIEDL IKTLLDISKL DAGLIQPQPR SLKLADVFAS VEASFGPLAA
RKGLRLSVRP GDLWVSSDLV LLQRIIQNLV SNGIRYTRTG GVLVAARPRG GDVRVDVIDS
GAGIPEADRE LIFEEFHRGG RESVDGEIAL GLGLSIVRRS AEALGHRLSL VSRVGHGSRF
TLILPRAQAE APRPAQPLAL PTSLTGARIA VIENDRAALE ALARLFQNWD AHALAARDHL
SLIRLTGSTW SPDVIIADYH LDGGACGLDT VTWLRTVHGG DIPAVVTTAD HSPEVEAHVR
AAGCELIHKP IKPGQLRALL AHLLSKS