Gene Mext_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4081 
Symbol 
ID5832962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4539344 
End bp4542031 
Gene Length2688 bp 
Protein Length895 aa 
Translation table11 
GC content70% 
IMG OID641369872 
Producthistidine kinase dimerisation/phosphoacceptor 
Protein accessionYP_001641522 
Protein GI163853479 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR02373] photoactive yellow protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.628066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.498394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGAAGG CAGACCCGGC ATTGCTTAAC CGGGGCGTGG CAGGCATACC GGCTGCCGTA 
ATGGTCGATC CGTCGCAGAT GCATGCCGTG GGGCCGAACG GTGGTCCGGT CGACCTTCCG
GTCGATCTCG ATGCCTTGAC CGCCGAGCAG CGGGATGAAT TCGGCGTGGG CATTCTTGCC
CTCGACGCCG CCGGAATCGT GCTCGCCTGC AATCGGGCCG CGGGCGCGCT GTGCGGCCTG
CCGCCCAATA CGATGATCGG GCGGAGTTTC TTCCGCGAAC TCGTGCCGAG TGCCAACGTG
CCGAGCTTCT ACGGCCGCTT TCTCAGCGGC CAGCGCCGGA GCGTGGCCGA TCAGGCCTTC
GAGTTCGTGT TCGGCCGCAT CCCGGCCCCC CTGCGGGCAC GAATCGGCCT GCGGTCCGGG
GCGAACGGGC ACATCTGGCT GACGATCACC CCCCTGGAGC AGATCGCCGC CGGCCCCTCG
CGCGAGGCCG TCCTCGCGGC GATCGCTCAG CGCAGCCGGG CCGAGCCGGT CGATCCGAGC
CTGTGCGAGC GTGAGCCGAT CCACATTCCC GGCTCGATCC AGCCGAACGC GGTGATGCTC
GCCGCCGACG CCGCGAGCCT CGAGATCCTG GCCTTCAGCG CCAACGCCGC CGACGTGCTG
GCGCCCGATC TCTTCCCGCC CAACGGCCTC AACCTGGAGG CGGTGCTGCC GGGCGCGATC
GTCTCCGCGA TTCGCGATGG GCTCGCCGCC CACACCCTGA CCGACGGTCG GCTCCTGCGC
CGCTCGCTCA CGCTTCCGCC GCGGGGAGAG CGCTTCCACC TCGTGGCCCA TGCCCATCTC
GGCCGGGTGA TCGTCGAGCT GGAACTGGCG CCGGAGCGCC CCGAGGACTT CCTCGCCGCG
AGCCCGCTCG ACGCCGAACT CGCCATGATG CGGCTGCGCG CCGCCGAGTC CCTGACCGAG
GCGGCGCAGA TCGCCGCCCT CGAAATCCGC GCCATGACCG GTTTCGAATC GGTGCTGGTC
TACCGGTTCG ACACGGATTG GAACGGCGAG GCCATCGCCG AGGACATGGT GCCGGACTGG
CAGCGCCCGC TGATCGGCCT GCGCTTTCCC GCCTCCGACA TCCCGGCCCA GGCCCGCGCC
CTCTACACCA AGGCCAAGAG CCGCTTCGTG ATCGACCGCG ACTGCGTGCC CGTGCCGCTG
GTGGCCGACC GCGCCGCGGG CAACGCGCCG GTCGACCTCA CTTTCGCGCA GAACCGAACG
CTCTCGCCGA TCCACCTCGA ATACCAGCGC AATCTCGGCG TCGACGGCTC GATGTCGATC
TCCATCATGG TCGAGAACCG GCTCTGGGGC CTGATGATTG GCCATCACCG CCGGCCGCAC
TACGTCGCGC CGGAGACCCG CGCCGCGGCG ACCGTCCTCA CCGACGCCTT CGCCATGCGG
GTGCAGGAGA TCGAGGGCAA GGCGCTCTGG GGCGAGCGGC AGCGCCATCT CGACGTGCAG
GGCCGGCTGG TGCGCGGGCT CACCCGTTCC GACGATTTCG TCACCTCCCT GACCCAGGGC
GATCCGACCC TGCTCGACCT GTTCGGCGCC ACGGGTGCGG GCATCGTCTC GGACGAGGCC
GTCTGCCTCG TCGGCGTCAC GCCGGAGGCG GCGAAGGTGC GGGCGCTCGC CGATTGGCTG
CGAGAGAGCC TGCCACCCGA CGAGACCACC TTCGTCACCG ACACGCTGGT GCTGCATCAC
GCGCCGGCCG CGGACTTCAC GGAGATCGCC AGCGGCCTGC TCGCGGCCTT CGTCGGCACC
TCGCGCCAGC ATCTGCTGTT CTGGGTCAAG CCGGAGGTGC CGAGCACGGT GACCTGGGGC
GGCGATCCGC GCAAACCCGT CCTGCCCGGC AGCGGCCCGG TGGCGGTGCT GCCGCGGCGC
TCGTTCGAGC GCTGGATCGA GGAGCGCCGT GGCCATTCCA CCCCCTGGGC GACGTGGAAG
GTGGCGCTCG CCGCGCAGCT CGCCGACGCC GTGGACGGCG TGGTGCTGCG CCAGCGCCGC
AAGATCGACG AGCTGACGGG GCTGCTGGCC GACAAGGAGC GCCTGCTGGA GCAGAAGGAT
CTGCTCACCC GCGAGATCGA CCACCGGGTC AAGAACTCGC TGCAGATCGT GACGGCCTTC
CTGCACATGC AGCGCCGGCA GATCGCCGAC CCGGAGGCGC GCCAAGCCTT CTCCGAGACC
TCGGCCCGCG TCATGAGCGT CGCGCGGGTG CATGACAGCC TGTACCAGGG CGAGAGCATG
GAGCAGGTCG ATCTCGGCCA GACCATCCAG ACCCTGTGCA GCGACCTCGC CGGTATGGCC
GGCGACGAGC ACAGCGTCGA GCTGACCGCC GAGCCCGGCC TGATGGTGCC CTATCGGCAC
GCGGTGGCGC TCTCGCTGAT CACCACCGAA CTCGTCACCA ACGCGTTCAA ATATGCCGGC
AAGCCCGAGA AGGGGGCGCG GATCAGCGTC TCCGTGGCCG GCGGCGAAGG GGCGGCCGTC
CGCCTCAGGA TCTGCGACGA CGGCGAGGGC ATGCCGACGG GCTGGAAGAA CGCCAAGGCG
CGGGGCACCG GGCTCGGCAT GAAGCTGATC CGCGCCATGC TCGACCAGAT CGGCGCCCGC
CTCGACGTCG AGAACGCCGA CGGCGCCTGC TTCACCGTTC ACGCCTGA
 
Protein sequence
MAKADPALLN RGVAGIPAAV MVDPSQMHAV GPNGGPVDLP VDLDALTAEQ RDEFGVGILA 
LDAAGIVLAC NRAAGALCGL PPNTMIGRSF FRELVPSANV PSFYGRFLSG QRRSVADQAF
EFVFGRIPAP LRARIGLRSG ANGHIWLTIT PLEQIAAGPS REAVLAAIAQ RSRAEPVDPS
LCEREPIHIP GSIQPNAVML AADAASLEIL AFSANAADVL APDLFPPNGL NLEAVLPGAI
VSAIRDGLAA HTLTDGRLLR RSLTLPPRGE RFHLVAHAHL GRVIVELELA PERPEDFLAA
SPLDAELAMM RLRAAESLTE AAQIAALEIR AMTGFESVLV YRFDTDWNGE AIAEDMVPDW
QRPLIGLRFP ASDIPAQARA LYTKAKSRFV IDRDCVPVPL VADRAAGNAP VDLTFAQNRT
LSPIHLEYQR NLGVDGSMSI SIMVENRLWG LMIGHHRRPH YVAPETRAAA TVLTDAFAMR
VQEIEGKALW GERQRHLDVQ GRLVRGLTRS DDFVTSLTQG DPTLLDLFGA TGAGIVSDEA
VCLVGVTPEA AKVRALADWL RESLPPDETT FVTDTLVLHH APAADFTEIA SGLLAAFVGT
SRQHLLFWVK PEVPSTVTWG GDPRKPVLPG SGPVAVLPRR SFERWIEERR GHSTPWATWK
VALAAQLADA VDGVVLRQRR KIDELTGLLA DKERLLEQKD LLTREIDHRV KNSLQIVTAF
LHMQRRQIAD PEARQAFSET SARVMSVARV HDSLYQGESM EQVDLGQTIQ TLCSDLAGMA
GDEHSVELTA EPGLMVPYRH AVALSLITTE LVTNAFKYAG KPEKGARISV SVAGGEGAAV
RLRICDDGEG MPTGWKNAKA RGTGLGMKLI RAMLDQIGAR LDVENADGAC FTVHA