Gene Mext_3139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3139 
Symbol 
ID5835543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3490656 
End bp3492257 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content69% 
IMG OID641368940 
Producthistidine kinase 
Protein accessionYP_001640598 
Protein GI163852555 
COG category[T] Signal transduction mechanisms 
COG ID[COG0784] FOG: CheY-like receiver
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.468681 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.330779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA TCCCCGAGAA GAGACCCCAC GGCGTCCATC CCAGCGTCGA GGCGGGTGCG 
AAGGGCCACG AATTCGAGTC CGGTAGCGGC ATCTTCTTCG CCGCCGTCGA GATGACACGC
ATGCCGATGG TCGTGGTCGA TCCGAACCAG GACGACCACC CGATCGTGTT CGTGAACCAA
GCCTTCCTGG AAATGACCGG CTACGCGCGG GACGAGGTGA TCGGCCACAA CTGCCGCTTC
CTCCAGGGCC CCGAGACCGA TCCGGCGACG CGGGCGCTGG TGCGCGACGC CGTCGCGGCG
CGGCGCGACG TGGCCACCGA GATCCTCAAC TACCGCAAGG ACGGCTCATC GTTCTGGAAC
GCCCTGTTCG TGAGCCCGGT CTATAACGCC GCCGGGGACC TCCTGTACTT CTTCGGCTCG
CAGCTCGACA TCACCCGTCG CCGACTGGCG GAAGACTCGC TGCATCAGGC GCAGAAGATG
GAGGCGATCG GCCGGCTCAC CGGCGGCATC GCCCACGACT TCAACAACCT GCTCCAAGTG
ATCCTGGGCT ATGCCGACTC GCTGGCGACC AACTTGGACC GGCCGGACGC CGACCGGGGC
CGCATGGGCC GCGCCGTCGG CAACATCCGC GAGGCGGCCG AGCGGGCCTC GACGCTGACG
CAGCAACTGC TCTCGTTTGC CCGCAAGCAG CGCCTCGACG GCCGCACCCT CAACCTCAAC
GACCTCGTCT CCGAGACGAA GGAGCTGGCC GGGCGCACGC TGGGCGACGC GGTGACGATC
GAGACAGATC TCGCCCCCGA CCTCTGGCCG TGCCGGATCG ACCGGACCCA GGCCGAGGTC
GCCCTGCTCA ACGTGCTCAT CAACGCCCGC GACGCGATGC CGGAGGGCGG GCGCGTCACG
ATCACGACCC GCAACGAGGA ATCCGGCCGG CCCGACGGTT CGGGCCGGCA CGTGAGCGTC
GCGGTGACCG ATACGGGCGC GGGCATCCCC TCGGACATGC TGGCGCGGGT GATGGACCCG
TTCTTCACCA CCAAGGAGGA GGGGAAGGGC ACCGGGCTCG GCCTGTCGAT GGTCTACGGC
TTCGCCAAGC AGTCCGGCGG CTTCGCGCAG ATTGAGTCGG TGATGGGGGA GGGGACGACG
GTGCGTCTCT CCTTCCCCGC GAGCGACGAG GCCGGTGCGC CGGAGAGCGC AGAACCGGCC
GCCATCGTCG AGGAGCGGCC GGGCACGGAA ACGATCCTGA TCGTGGACGA TCGCGCCGAC
GTCGCCGAAC TCGCCCGGGC GATCCTGCGC GATTACGGCT ACGGCACCCT GATGGCCCGC
CACGGCCGCG AGGCCCTGGA GATCCTGAAC GACCATCCGG AGATCGACCT GCTGTTCTCC
GACCTGATCA TGCCCGGCGG CATGGACGGC CTGACGCTCG CCCGCGAGGC GCGCCGCCGC
CATCCCGATC TGAAGATCCT GCTCACCACC GGCTATGCCG AGGCCAGCCT GGAGCGAACC
GGAATCGAGC GTCCGGAGTT CGATATCCTG AACAAGCCGT ACCGCCGTGC CGAGCTGATC
CGGCGGGTGC GGGCGGCCAT CGATGCGCCG AACCGGAGCT GA
 
Protein sequence
MTDIPEKRPH GVHPSVEAGA KGHEFESGSG IFFAAVEMTR MPMVVVDPNQ DDHPIVFVNQ 
AFLEMTGYAR DEVIGHNCRF LQGPETDPAT RALVRDAVAA RRDVATEILN YRKDGSSFWN
ALFVSPVYNA AGDLLYFFGS QLDITRRRLA EDSLHQAQKM EAIGRLTGGI AHDFNNLLQV
ILGYADSLAT NLDRPDADRG RMGRAVGNIR EAAERASTLT QQLLSFARKQ RLDGRTLNLN
DLVSETKELA GRTLGDAVTI ETDLAPDLWP CRIDRTQAEV ALLNVLINAR DAMPEGGRVT
ITTRNEESGR PDGSGRHVSV AVTDTGAGIP SDMLARVMDP FFTTKEEGKG TGLGLSMVYG
FAKQSGGFAQ IESVMGEGTT VRLSFPASDE AGAPESAEPA AIVEERPGTE TILIVDDRAD
VAELARAILR DYGYGTLMAR HGREALEILN DHPEIDLLFS DLIMPGGMDG LTLAREARRR
HPDLKILLTT GYAEASLERT GIERPEFDIL NKPYRRAELI RRVRAAIDAP NRS