Gene M446_3558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3558 
Symbol 
ID6134424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3972125 
End bp3975079 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content75% 
IMG OID641643725 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001770373 
Protein GI170741718 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.299477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGG AGACCCCATC CCCGAGCCGC CGGCTGCCGC GCTTCGCGCG CCCGCGGTCG 
CGCGCCGCCC TCCTGGCCGT GCTGCTCCTC GGCGTCAGCT TCGCGGCGGC CCTCCTGAAC
TACCTATCGG GCACGCGCGC CGCGGCCGAG GCGGTGCGGC TCAACGGCGT GATCGCCCAG
AGCGAGCGCG TGCTCTCGAC CCTGAAGGAC TTGGAGACCG GCCTGCGCGG CTACATCCTC
ACCGGCGACG GCGCCTATCT CGACCCCTAC GCGGCGGCGC TGACGCACAT CGACCCGGAG
CTCGACCGGC TCGACGGCTT CGACCGCGAC CCGGACACCC GCGCCCGGCG CCGCCGCCTG
ATCGCGGACC AGCGCGCCTA CGCGGCCCGC GCGGTCGCGG CGCGCCGGGA GCAGGGCCCC
GAGGCCGCCT CCGCCCTGGT GCGGAGCGGG GAGGGCAAGC GCATCATGGA TGCGCTGCGG
ACCCTCACGG CGGCGGTGCA GGACCGGGCG GTCGCGGATC TCGACCGCAT CGCGGCGCGC
GAGCGCCTGC GCTCGCCGCT GCTGCAGGTC CTGGTGCTGG CCTCGGCGCT GGCGGCGGCG
GGCCTGCTCG CGCGCCTCGC CATCCTGCGC CGCCGCGAGG GCCGCCGCAC CGCGGCGCTC
CTCGCCGGGG TGCTGGAGAA CGCGCCCGTC GGGCTCGGCT TCCTCGACCG CGACCTCACC
ATCCGGCACA TGAACCGGGC GCTGGAGAGC ATGAGCGAGC GCGGCCTCGG CACCGGGGTC
GGAGAGCCGA TCTGGGCGCT GCTGCCCTCC CTGCGCGACG CGCTCGCGCC CAAGCTCGCG
GCGGCGCGCG ACGAGGGCCT GGTCACGGCG AATGTCGAGG TCGGGGTGCC GACCCCGAGC
GCGCCGGGCA GCGTGCGCTA CTTCCAGATG AGCTTCTTCC CCCTGCGCCG CGACGTCGAC
GACCGCGCCG CGCGGGCGGA GGGCGTCGGG CTCGTGATGT CGGACGTGAC CCTGCGCAAG
CTCTCCGAGG CGCGCACGCG GGAGAGCGAG GAGCGCTTCC GCTCGCTCAC CGAGGCGACC
TCGGCCATCG TCTGGCGCAC CACCCCGGAG GGCACCTTCG CGCAGGTCAC CTCGGAATGG
ACGCGCTTCA CCGGCCAGAC CCCCGAGGAG GCGGCGGGGC TCGGCTTCCT CGACGCCGTC
CATCCCGAGG ACCGCGCCGC CACCCGCGCG GCCTGGGAGC AGGCGGTCGC GACCCACGCG
CTCTACGCGA TCGAGCACCG GCTGCGCCGC CACGACGGCA TCTACCGCCA CATGGAGGTG
CGCGCGGTCC CGATCCTGGA GAAGGACGGC CGGGTGCGCG AGTGGGTCGG GGCGCATGCC
GACACCACGG CGCGCAAGGA GGCCGAGTTG GCGCTCGAGG CCGCCCGCGA GGCGGCCGAG
GAGGCGAACG CCGCCAAGAG CCAGTTCCTG GCCAACATGA GCCACGAGCT GCGCACGCCC
CTCTCGGCGG TGATCGGCTA CGCGGAGATG CTGCAGGAGG AGATGGAGGA TCTCGGCGCG
TCGGCGCTGC TGCCCGACAT GCGCAAGATC GAGGCGAATG CCCGCCACCT GCTCGGGCTG
ATCAACGACG TGCTCGACAT CTCGAAGATC GAGGCCGAGC GGATGGAGGT CTATGCCGAG
GATTTCGACG TCGCCGCGAC CCTGCAGGAT GTCGGCGCCA CGGTGGGGTC GCTGATCGCC
AAGAAGGACA ACGCGCTGGT GCTGGACCTC GCGGAGGGGC TCGGGCGGGC GCATACCGAC
GTCACGAAGC TGCGCCAGTG CCTGATCAAC CTCCTCAGCA ACGCCGCGAA GTTCACCGAG
GGCGGCCGGA TCGTGCTCTC GGCCGAGCGC CTGCGGCGGG ACGGCCGCGA CCGGCTGCGC
TTCCGGGTCG CCGACACGGG CATCGGCATG AGCGCGGAGC AGCAGGCGCG GCTGTTCGAG
CGCTTCACCC AGGCGGATGC CTCGACGACC CGCCGCTTCG GCGGCACCGG GCTCGGCCTC
GCCATCACCC GCGCCTTCGT GGAGATGCTG GGCGGCGCGA TCGCGGTCGA GAGCCGCGCA
GGGGAGGGCA CGACCTTCAC GATCGAGCTG CCGGTCCGCT ACCGGGCGGA GGCGGAGGCC
GGGGAGGACG AGGCCGATGC CGCGGCGCCC GCGCCCGCCG CGGAGGCGGC GCGGGAGGCC
GGCGGGGACC TCGTCCTCGT CATCGACGAC GACCCGGCGA CCCGCGACCT CCTCGCCCGC
TTCCTGCGGC GGGACGGGTT CCGGGTCGCC GCCGCGCCGG ACGGGCGCGC CGGGCTGGAC
CAGGCCCGGG CGCTGCGCCC CCGGGTGATC CTGCTCGACG TCACCATGCC GCGCATGGAT
GGCTGGGAGG TGCTGCGCGC CCTGCGGGCG GACCCGGACC TCGCCGCGAC GCCCGTGATC
ATGGTGACGG TGCTCGACGA GCAGAACCTC GCCTTCTCGC TCGGCGCGAC CGACTACCTG
CACAAGCCCG TGGCCTGGAA GCAGCTGAAG GAGGCGATGG AGCGCTTCCG GCCGGCGATC
CACGAGGGGC CGGTCCTGGT CGTGGACGAC GACCCCGACG TGCGCGAGCG CATCACCGCG
CTCCTCACCC GCGAGGGCTG GCGCGCCGCC TCGGCGGCGA ACGGCCGGGC CGGGCTCGAC
GCGGTGGCGG TGCGCAAGCC GGGGCTGATC CTGCTCGATC TGATGATGCC CGAACTCGAC
GGGTTCGGCT TCCTGCGCGG CCTGCGGGCG AGGCCGGAAT GGCGGGACAT CCCCGTCGTG
GTGCTGACCG CCAAGGACGT GACCGCCGAC GAGCGGCGGC GCCTCGCCGG CCAGGCCGAC
CGCGTCCTGC AGAAGGGCGG CCTGAGCATG GCCGACCTCG CCGCCACCGT CCGCTCCCTG
CTCGTGCCGA GCTGA
 
Protein sequence
MTEETPSPSR RLPRFARPRS RAALLAVLLL GVSFAAALLN YLSGTRAAAE AVRLNGVIAQ 
SERVLSTLKD LETGLRGYIL TGDGAYLDPY AAALTHIDPE LDRLDGFDRD PDTRARRRRL
IADQRAYAAR AVAARREQGP EAASALVRSG EGKRIMDALR TLTAAVQDRA VADLDRIAAR
ERLRSPLLQV LVLASALAAA GLLARLAILR RREGRRTAAL LAGVLENAPV GLGFLDRDLT
IRHMNRALES MSERGLGTGV GEPIWALLPS LRDALAPKLA AARDEGLVTA NVEVGVPTPS
APGSVRYFQM SFFPLRRDVD DRAARAEGVG LVMSDVTLRK LSEARTRESE ERFRSLTEAT
SAIVWRTTPE GTFAQVTSEW TRFTGQTPEE AAGLGFLDAV HPEDRAATRA AWEQAVATHA
LYAIEHRLRR HDGIYRHMEV RAVPILEKDG RVREWVGAHA DTTARKEAEL ALEAAREAAE
EANAAKSQFL ANMSHELRTP LSAVIGYAEM LQEEMEDLGA SALLPDMRKI EANARHLLGL
INDVLDISKI EAERMEVYAE DFDVAATLQD VGATVGSLIA KKDNALVLDL AEGLGRAHTD
VTKLRQCLIN LLSNAAKFTE GGRIVLSAER LRRDGRDRLR FRVADTGIGM SAEQQARLFE
RFTQADASTT RRFGGTGLGL AITRAFVEML GGAIAVESRA GEGTTFTIEL PVRYRAEAEA
GEDEADAAAP APAAEAAREA GGDLVLVIDD DPATRDLLAR FLRRDGFRVA AAPDGRAGLD
QARALRPRVI LLDVTMPRMD GWEVLRALRA DPDLAATPVI MVTVLDEQNL AFSLGATDYL
HKPVAWKQLK EAMERFRPAI HEGPVLVVDD DPDVRERITA LLTREGWRAA SAANGRAGLD
AVAVRKPGLI LLDLMMPELD GFGFLRGLRA RPEWRDIPVV VLTAKDVTAD ERRRLAGQAD
RVLQKGGLSM ADLAATVRSL LVPS