Gene M446_5584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5584 
Symbol 
ID6133327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6123549 
End bp6125309 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content65% 
IMG OID641645707 
Productsignal transduction histidine kinase 
Protein accessionYP_001772321 
Protein GI170743666 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCCA TCGTGCCTGT TCAGCAGGCG TATGAGCCAG ATCCGCTCAG AGACCTCCTA 
AACGGCTTAC GGGACGGCTT CATCGCCCTC GATGAACGGT GGTGCTTCAC CGAGATGAAC
CCGGCCGCGG AGACGCACTT CGGCCGCGGC CGCGAGAGCG CGCTCGGCGC ACCCATTCAA
GACCTGATCC TCCCCTTCGC TGGAAGTGAG ATCGAGGCAC GCTGGCGGCA CGTGCTGGTC
TCAGGTGAGC CGGCCCTCTT CGAGGCACCC TCGGCCGTGC GACCGGACCG CATCACTGAG
TTCAACGTGT TTCGGTTCGG CGCGGGCCTG GGCGTCACAT TCCGAGACGT GACGGACGCT
CGGCAGGCCG ACGCCGCTCT TCGAGAGAGC CAGTCCCGCC TAGAAATCGC TACGGAAGCG
GCGCGGCTTG GTGTCTGGGA CTGGAACTTG CTCACCGACG AGATGGTCTA CTCGGAGCGA
GCGTGTGCCA TCCACGGTCT CTCCCCGCAC GCTCCGGTCA CCCTGGACAT GCTGCGCGGC
GCCACCCATC CCCAGGATCT GCCCCGTACC ACTGAGATGG CCGAGCGCGC CCTCGATCCG
GCCATTCGGG AGCGCGTCCC CTATGAATAT CGCATCATCA GGCCCTCTGA CGACACGGTT
CGTTGGGTGC TGGCACACGG CGAGGCCGTA TTTGCTCCTG TGGATGGCGT CGAACGAGCG
GTCCGCTATG CTGGCACGCT TCAGGATATC ACCGCCCAAC TTGAGGCCGA GGAGGCCCTG
CGCTCCAGCG AGGGCCGCCT CCGCCTCGCC CTAGACGCGG GTCGAATGGC CGTGTGGGCC
TACGACGTCG CGACCGACTC GGTCCAGGGC TCAGCGGAAC TCAACCGCAT CTACGGCTTC
CCGCCCGAGG CGTGTCCGAC GCTGGGTGAA TTCCGATCCC GCTACTATCC CGGTGACCGA
GAGAGGCTGA CCGCCGCCTG GAGCGAGGCG CGAGCGCGCG ATGACCGCTA CTTCGAGGCA
GAGCACCGAT GCGTGTGGCC GGACGGAAGC GTTCGGTGGC TGCTCCTGAG GGCCGAAACC
AAGGAAGACA GTGCTGGGCA GCCGGCCAAC ATCGTCGGTG TGGTGCTGGA CATCACAGCG
AGAAAGCGGG CCGAGGAGCA CCGAGCACTG CTCCTCCACG AACTGAACCA CCGCGTGAAG
AACACCCTCG CCACCGTGCA GGCCATCGCC CATCAGACGT TCAGGGGAGA TTCCAGCGAC
CGGACGGAGA CGTTCGAGGC GCGGCTGCTC GCCTTATCCA AGGCCCACGA CCTGCTCACG
CGTGAGAGTT GGGAAGGGGC GAACCTGACT GAGATCGTGT CGGCAGCTAT TGCGCCTTTC
CGCCGGACGG ACGGCACGCG CTTCCAGATC GTCGGCCGTC AGGTCTGGTT GGCACCGCGG
ATTGCGCTGG CGCTCGCCAT GGCGCTGCAT GAGTTGGGTA CGAATGCGGC CAAGTATGGG
GCGCTGTCCA CGATGAGCGG TCGCGTTCTG ATCGGCTGGT CTGTTTCCGG TTCGAAGCCC
ACCCACCTCA TTCTGCGCTG GTCGGAACAG GGCGGTCCCT CGGTGGTGCC CCCGACACGC
AAAGGGTTCG GTACGCGTCT GATCGAGCGC ACGTTAGCGA GCGAGATGCG GGGAGATGTG
GACATCAGCT ACGAGCCGAC TGGTGTTGAG TGTGCCTTGG GGATCGCTCT CGATGATGAC
GCGAGCAGTC CACCGGTTTA G
 
Protein sequence
MLAIVPVQQA YEPDPLRDLL NGLRDGFIAL DERWCFTEMN PAAETHFGRG RESALGAPIQ 
DLILPFAGSE IEARWRHVLV SGEPALFEAP SAVRPDRITE FNVFRFGAGL GVTFRDVTDA
RQADAALRES QSRLEIATEA ARLGVWDWNL LTDEMVYSER ACAIHGLSPH APVTLDMLRG
ATHPQDLPRT TEMAERALDP AIRERVPYEY RIIRPSDDTV RWVLAHGEAV FAPVDGVERA
VRYAGTLQDI TAQLEAEEAL RSSEGRLRLA LDAGRMAVWA YDVATDSVQG SAELNRIYGF
PPEACPTLGE FRSRYYPGDR ERLTAAWSEA RARDDRYFEA EHRCVWPDGS VRWLLLRAET
KEDSAGQPAN IVGVVLDITA RKRAEEHRAL LLHELNHRVK NTLATVQAIA HQTFRGDSSD
RTETFEARLL ALSKAHDLLT RESWEGANLT EIVSAAIAPF RRTDGTRFQI VGRQVWLAPR
IALALAMALH ELGTNAAKYG ALSTMSGRVL IGWSVSGSKP THLILRWSEQ GGPSVVPPTR
KGFGTRLIER TLASEMRGDV DISYEPTGVE CALGIALDDD ASSPPV