Gene M446_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1526 
Symbol 
ID6131990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1700054 
End bp1702981 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content69% 
IMG OID641641794 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001768463 
Protein GI170739808 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAG CGCCGATCGG AACGCCGCAC GAGCGATCTC TCGGAACGGG CACCGCGCCC 
CGCGGCGAGG GGTTCCTCGA TGCCGCCGGG GTCGCCAAGC AGAAGCTGAT CGAGGCGATC
GAGAGCAGTT CCGAAGGCTT CGCGCTGTTC GATCCCGACG ACCGCCTCGT CCTCTGCAAC
GATCACTTCC GGGATTTCCA CCCGGGTCTC GCGGAGGTGA TCGTTCCCGG CGCGTCTTTC
CAGACGATCG CCCGCGCCGC CGCTGAATCC TGCATCGTGC ACAAGGAAGG CACGACCGTC
GAGGCGTGGC TCGCCGAGCG GATGGCTCAT TATCGAGAGC CTCGCGGATC CATCCTCCAG
CAACGCGTCG ACGGGCGCTG GGTGCAGGTC AGCGAGCGCA AGACCAACGA CGGCGGCGTC
GTCGCCGTCT ACACGGATGT GACCGAGATC AAACGCGCCG AGCAGGTCGT GCTCACGACA
CAGGGCAGGC TGACCTACCT CTTGACCGCC TCCCCGTCGA TGATCTGCAG CTTCGAGGTG
GGCGGTCGAA ACGCGCCCAC GTTCATCAGC GAGAACGTGC GCGACCTCCT GGGCTACGAG
CCGAGCGATT ACATGGCCGG GCCGGAATTC TGGCTGGACC GCCTCCATCC CGAGGACCGG
GACCGGGTCC TGTCCGAATT CCCACGCCTG CTCCAGCAAG GGCACAACGT CATCGAGTAC
CGGTTCCTGC GCGCCGACGG CACCTACCGG TGGGTGAGGG ACGAGCAGCG CCTCCTGCGG
GATCCTCATG GAGAACCGGT CGAGGTCGTC GAGTCATGGA GCGACATCAC CGAGCACAAG
GAGGCCGAAC TCGCGCTGCA GAGGCAGACC GCTTTCGTCG AGCTGCTGCA GGCCGCGGCA
ACGGCGGCCA ACGAGGCGTC CACGGTCGAG GACGCCGTGC GGTTCTGCCT GGACCGCGTC
TCGCAACATG CCGGCTGGCC GATCGGACAC GCCTACGTCG TGGCCGAGGA CGGAAGCCGC
ACGCTTGTCC CGACCGGCAT CTGGCACTGC GACGCGCCGG AGCGGTTCGC GCGGTTCCGC
GCCGCGGCGG CCGGGATGCG GATCGGCGCC GGCACGGGAC CTGCCGGCCG CGCGCTCCTC
TCGGGAGCAC CGGAGTGGAT CCTCGGCGAT CCGTCCTCGC CCGACGATCC GCGCGCCGCC
GCGGCGGCGG CGGATGGGCT TCGGGCTGGC TTCGGGTTTC CCGTGACGGT GGGGCGCGAG
GTCGTGGCAG TCTTGGAGTT CTTCGCCTGC GATGCGCCTC CTCCGGACCC GGCCCTGCTC
AGGGTGATGA CCAATATCGG TGCGCAACTG GGGCGAGTGA TCGAGCGCAA GCGCGCCGAG
GAGAGCCTGC GGCAGGCCAA GGAAGCGGCG GAGGATGCGA GCCGCGCCAA GAGCAGCTTT
CTCGCCAATA TGAGTCACGA GCTGCGCACT CCGCTCAACG CGATCATCGG CTTCACCCGG
CTGGTGATGC GCCGGGCCAA GGAGGCGTTG CCGACGAAGC AGTTCGAGAA CCTCGAGAAG
ATCCTGGCGA GTTCCGAGCA CCTGCTCTCG CTGATCAACA GCATCCTCGA TCTCGCCAAG
GTCGAGGCCG GGCGCATGGA GGTGAAGGCG TCCGAGTTCG CCCTGGAGCC CGTGCTCGAC
CTCTGCCTCA GGACGGTCGA ACCCCTGATC AAGAGCGAGG GCGTGCGTCT CGTCCGGGAC
GTCGAGGATC CGCCCACGAT GCTCCGGACG GATGAGGAGA AGCTCCGGCA GATCCTGATC
AACCTCCTCA GCAACGCGAT CAAGTTCACG GAAGCCGGAT CGGTGACGCT GCGGGTCCGC
TCGGCCGGCG AGCACGTCGA ATTCGCGGTC ACGGACACGG GTATCGGCAT CCCCCAGGAG
GCGCTGAGCG CGATCTTCGA CGAGTTCCAT CAGGTCGACA ACAGCGCGAC CCGGTCCCAC
AGCGGGACCG GGCTCGGTCT CGCGATCAGC CACCGGCTGG CCCGGCTGCT CGGCGGCCAC
ATCGACGTCG AGAGCCGGGT CGGGCAGGGC TCAACCTTCA CGCTCAGCAT CCCGCCGCGG
ATCGCCGGCG CGCCCGAGAT GGCACCGAGC CTGCCGGAGC CGGCCCCCGC CGCGGCGGCC
GTGCCCCGGT CGGGGTCCAA GCTCGTGCTC GCGATCGATG ACGACCCGAA CGTGGTCTAC
CTGCTGCAGG AGAACCTCGC CGATGCGGGC TACACGGTGA CCGGCGCTTC GAGCGGCCAG
GACGGGCTGC GGATGGCGCG GGAGCTGCAG CCGCGGGCGA TCACGCTGGA CATCATGATG
CCTGGCACGG ACGGCTGGCA GGTGCTGCAC GCGCTCAAGA CCGATCCGCA GACCCGCGAC
ATCCCCGTCG TCCTGATCTC GATCGTCGAT CAGAAGGAAC TCGGCTTCCG GCTCGGGGCG
ACGGACTACA TCGTGAAGCC CTTCGAGCGG GAGGCCCTGA TCGGCGTGCT GGCGCGCATC
GCCCCCGGCA ACGAGCGCGT CCTGGTGATC GACGACGACC CCAACGTGCC CGACCTCGTC
CGCCAGTTGC TGGATTCCGA GCACTGCACC GTGGACTGGG CGGCGGACGG CGCCGCCGGC
CTTGAGCGCA TCGCGCAGGC CCGCCCGAGC GTGATCCTCC TCGACCTGCT CATGCCACGG
ATGGACGGAC TCACCTTCCT CGACGCGCTC CAGGCGGATC CGGTCGCTCG GACCATTCCC
GTCGTCGTGC TGACGGCCGC GTCCCTGGAC TCGGTGGAGC GCGGCCTGCT GCGAGAGCGC
GTGCTCGGCC TGATCGACAA GCAGGGCCTC GACCGCGCGG CCCTCATCCG CGAGGTTCAG
CGCGCATTGC CGCTTCCGGA GCCCGCCGCC GTGGAAGGCA GCCGATGA
 
Protein sequence
MSQAPIGTPH ERSLGTGTAP RGEGFLDAAG VAKQKLIEAI ESSSEGFALF DPDDRLVLCN 
DHFRDFHPGL AEVIVPGASF QTIARAAAES CIVHKEGTTV EAWLAERMAH YREPRGSILQ
QRVDGRWVQV SERKTNDGGV VAVYTDVTEI KRAEQVVLTT QGRLTYLLTA SPSMICSFEV
GGRNAPTFIS ENVRDLLGYE PSDYMAGPEF WLDRLHPEDR DRVLSEFPRL LQQGHNVIEY
RFLRADGTYR WVRDEQRLLR DPHGEPVEVV ESWSDITEHK EAELALQRQT AFVELLQAAA
TAANEASTVE DAVRFCLDRV SQHAGWPIGH AYVVAEDGSR TLVPTGIWHC DAPERFARFR
AAAAGMRIGA GTGPAGRALL SGAPEWILGD PSSPDDPRAA AAAADGLRAG FGFPVTVGRE
VVAVLEFFAC DAPPPDPALL RVMTNIGAQL GRVIERKRAE ESLRQAKEAA EDASRAKSSF
LANMSHELRT PLNAIIGFTR LVMRRAKEAL PTKQFENLEK ILASSEHLLS LINSILDLAK
VEAGRMEVKA SEFALEPVLD LCLRTVEPLI KSEGVRLVRD VEDPPTMLRT DEEKLRQILI
NLLSNAIKFT EAGSVTLRVR SAGEHVEFAV TDTGIGIPQE ALSAIFDEFH QVDNSATRSH
SGTGLGLAIS HRLARLLGGH IDVESRVGQG STFTLSIPPR IAGAPEMAPS LPEPAPAAAA
VPRSGSKLVL AIDDDPNVVY LLQENLADAG YTVTGASSGQ DGLRMARELQ PRAITLDIMM
PGTDGWQVLH ALKTDPQTRD IPVVLISIVD QKELGFRLGA TDYIVKPFER EALIGVLARI
APGNERVLVI DDDPNVPDLV RQLLDSEHCT VDWAADGAAG LERIAQARPS VILLDLLMPR
MDGLTFLDAL QADPVARTIP VVVLTAASLD SVERGLLRER VLGLIDKQGL DRAALIREVQ
RALPLPEPAA VEGSR