Gene Mchl_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1350 
Symbol 
ID7116323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1387885 
End bp1390935 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content70% 
IMG OID643524125 
Productsignal transduction histidine kinase 
Protein accessionYP_002420160 
Protein GI218529344 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.597121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGAAC ACTGTGTCGA GCCCCAGCCT GTCCGAACAA TGAATCCCGC CCCGATGAGC 
GAGCAAGAGG CTGCGCGCCT TCGTGCCTTG GACCGCTATC GGTTGCTCGA CACCCCGCGC
GAGCAGGATT TCGACGAGAT CGCCGAGGCC GCCGCCGAGC TGTGCGAGGC GCCGATCGCG
GTGATCAATC TCGTCGGCGA CGGGCGGCAG TTCTTCAAGG CGGAGGTCGG CCTCGGTGTG
CGCGAGACGC CGCTCGAAAC CTCCTTCTGT CGGCAGGCGA TCCTGCACGA CGACTTCCTC
TACGTGCCCG ACACCGCGCG CGACCCGCGC TTCGAAGGCA ACCCGCTCGT CAGCGGCGAT
CCCGGCCTGC GCTTCTACGC CGGCGCCCTG CTGAGGACCG ACGAGGGGCA GCCGATCGGG
ACTGTCTGCG TCCTCGATAC CCGCCCGCGC GAACTCTCGG AGCGGCAGCG CCTCGGCCTG
ATGCGGCTCG CCCGCCAGGC CATGACGCAG ATGGAACTGC GCCGCTCGCT GCGTGAGCAG
GCGGAGCAGC GCCTGCTGCA CGAGCGCATC CTCGACAGCG CTACTGATTA CGCGATCGTG
GCCATGGACC CCCAAGGCCG CGTCACGCGC TGGAACACCG GCGCCGAGCG TATCCTCGGC
TGGACCGAGG CCGAGATGCT GGGCCGGACG GTCGATGCGT TCTTCACGCC GGAGGATCGG
GCGGGCGACC GGCCCGATGT CGAGAAGCGC CTGGCGGCGC AGACCGGCAG CGCCCCGGAC
GAGCGCTGGC ACATGCGCAA GGACGGAACC CGTTTCTGGG CCTCGGGCGA GATGATGCCG
CTGACGGCCG AAGACGGCGG GCTCATCGGC TTCCTCAAGA TCCTGCGCGA CCGTACCGGG
CAGCGCGCCT CGGAGGCGGC TTTGCAGGCG AGCGAGTTGC GCTACCGCTC CCTCGTCGAG
GTCAGCCCGC AGGTCGTCTG GTTCGGCGAC GAGGCCGGCC GTGTCACCTA CTGCAACACC
TATTGGTACG ACTATACGGG GCTGCCCCCC GGCGAGACCG GCGAGGCGAG CTGGATGGGT
GTGATCCATC CCGATCACCG CGAGCGCGTT CGCGATGCGT GGCTTGCCGC GGCGCGAAGC
CGGGGGGGCT ACGAGGTCGA GTTCCCCCTT CGCCGCGCCG ACGGGCAGTA CCGCTGGTTC
CTGTCGCGGG CGCGGCCCGT GCGCGACGAG GCCGGGCACC TCAGGAGCTG GATCGGCACC
ACCCTCGACA TCCACGAGCG CAAGGTGGCT GAGGAGCGCT TCGCGGCGCT TACCGAACTG
GCCCCGGCCA TCATCTGGTT CGGGAATCCG GACGGCAGCC TCAGCTACCT CAACGACCGC
TGGTACGCCT ATACCGGCCA GACCCCCGAG CAGGCGCTGC CGCTCGGCTG GGGCGAGGCG
ATCCACCCGG ACGACGTTGA CGGCCTGCTC AAGGTCTGGG AGGCGGCCCG CACCCACGAG
ACCGTCTACG ACACCGAGGC ACGCCTGCGG GCGCGCGACG GAACCTACCG CTGGTTCCTG
ATCCGTGCCG AGCCGCGCCG GGACGCGAGT GGCGCGGTGG TCGGCTGGCT CGGCAGCAAC
AGCGACATCC ACGACCGTCG GCAGGCGGAC GAGGATCTGC GCCGGGCGCG GGAGCAGTTG
CATCTCGCCG TCGAGGCGAC CGGAACCGGC ATCTTCGACT ACGACCTCGT CACGGACACG
CTGGAATGGG ACGCGCGCAC CCGCGCGCTG TTCGGCCTGG GACCGGAGGC GCCGGTCGCC
TACTACGTGT TCCTGGCCGG CCTGCATCCG GAGGACCGGG CCTGGGTCGA TCGGGCGGTC
GAGGCCGCGC TCGATCCGGC CGGCAGCGGC ACCTACGACA TTGCCTACCG GACCATCGGC
CTGGAGGATG GCATCGAGCG CTGGGTCGCC GCCAAGGGAC AGGCCTTCGT CGCCGGCGGC
CGCACCGTGC GCTTCATCGG CACCGTGCGC GACGTCACGC AGAGCCGGCG GGCCGAGCAG
ACCTTGCGTG AGACCGAGGA GCGTTACCGC CTCGCGGCGC GTGCCACCAA CGATGCGATC
TGGGACTGGA ACCTCGCGAC CAACCAAGTC CTCTGGAACG AGGCCCTCAC GGTCGCCTAC
GGTTATCCGC CGGAGGCGGT CGATCCGACC GGCGATTGGT GGATCACCCA TATCCATCCC
GACGACCGGG CGCGGATCGA CGCCTCCATC CACGCGGTCA TCGACGGGAC CGGCACCGCC
TGGAGCGACG AGTACCGTTT CCTGCGCGCG AACGGCACCT ATGCCGACAT CCTCGACCGG
GGCTACGTCA TCCGTGACGG GCACGGGGCG GCGGTGCGGA TGATCGGGGC GATGCTCGAC
ATCAGCGAGC GCAAGCGGGC CGAGGAGCAC CAGCGCCTGC TCACCGGCGA GTTGCAGCAC
CGGGTCAAGA ACACGCTCAC CCTCGTTCAG GCGATCGCCA GCCAGACCCT CCGCAACGCC
CCGGATCTGG ATGCAGCCCG CGAGGCTTTT GCCGCGCGCC TGATCTCGCT CGGCCGCGCG
CACGACATCC TGACCCGGTC GAGCTGGACC GAGGCGCCGA TCGCGGAAGT CGTGGAGGGG
GCTCTGGCGG TCCATCGCGG CGCTGCCATG GCGCGCATCC GCGCGAGCGG ACCGAGCGTG
CTGCTCGGCG CCAAGGCGGC CCTCTCGCTC GCGCTGGCCC TGCACGAGCT GGCCACCAAC
GCGACCAAGT ACGGCGCCCT CGCCAACGAG ACGGGATGCG TCGAGTTGCG CTGGCACGTG
GTGCACGAGG ACGAGGCACC CCGCTTCTGC CTGACATGGT CCGAGCAGGG CGGTCCGCCC
ATCCTGAGCC AGCCCTCGCG CCGCGGCTTC GGCTCGCGCC TGATCGAGCG CAGTTTCGCC
GCCGAGGTCG GCGGAGAGGT CAAGCTCACC TACGCGCCGA CCGGCCTCGT CTGCCGCCTG
GAAGCCCCCC TCGCATCGAT GCAGGAGCCG CGCGACGAGG TCGCCGCCTG A
 
Protein sequence
MVEHCVEPQP VRTMNPAPMS EQEAARLRAL DRYRLLDTPR EQDFDEIAEA AAELCEAPIA 
VINLVGDGRQ FFKAEVGLGV RETPLETSFC RQAILHDDFL YVPDTARDPR FEGNPLVSGD
PGLRFYAGAL LRTDEGQPIG TVCVLDTRPR ELSERQRLGL MRLARQAMTQ MELRRSLREQ
AEQRLLHERI LDSATDYAIV AMDPQGRVTR WNTGAERILG WTEAEMLGRT VDAFFTPEDR
AGDRPDVEKR LAAQTGSAPD ERWHMRKDGT RFWASGEMMP LTAEDGGLIG FLKILRDRTG
QRASEAALQA SELRYRSLVE VSPQVVWFGD EAGRVTYCNT YWYDYTGLPP GETGEASWMG
VIHPDHRERV RDAWLAAARS RGGYEVEFPL RRADGQYRWF LSRARPVRDE AGHLRSWIGT
TLDIHERKVA EERFAALTEL APAIIWFGNP DGSLSYLNDR WYAYTGQTPE QALPLGWGEA
IHPDDVDGLL KVWEAARTHE TVYDTEARLR ARDGTYRWFL IRAEPRRDAS GAVVGWLGSN
SDIHDRRQAD EDLRRAREQL HLAVEATGTG IFDYDLVTDT LEWDARTRAL FGLGPEAPVA
YYVFLAGLHP EDRAWVDRAV EAALDPAGSG TYDIAYRTIG LEDGIERWVA AKGQAFVAGG
RTVRFIGTVR DVTQSRRAEQ TLRETEERYR LAARATNDAI WDWNLATNQV LWNEALTVAY
GYPPEAVDPT GDWWITHIHP DDRARIDASI HAVIDGTGTA WSDEYRFLRA NGTYADILDR
GYVIRDGHGA AVRMIGAMLD ISERKRAEEH QRLLTGELQH RVKNTLTLVQ AIASQTLRNA
PDLDAAREAF AARLISLGRA HDILTRSSWT EAPIAEVVEG ALAVHRGAAM ARIRASGPSV
LLGAKAALSL ALALHELATN ATKYGALANE TGCVELRWHV VHEDEAPRFC LTWSEQGGPP
ILSQPSRRGF GSRLIERSFA AEVGGEVKLT YAPTGLVCRL EAPLASMQEP RDEVAA