Gene M446_4851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4851 
Symbol 
ID6131668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5330180 
End bp5331328 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content73% 
IMG OID641644987 
Productsignal transduction histidine kinase 
Protein accessionYP_001771614 
Protein GI170742959 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.141965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0517417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG AGATGACCGC GACGCAGGCC GCGACCTCCC TGCCCCCCTC GCGCCTGACC 
CGGCTCGAGC GCTGGACGCA GGCGACGCGG CAGGTGCGGC GCAGCCGGCG GCTGAGCGAC
GCCTGCGGGG TGGCCGCCTT CGCGGTCGCC CTCGCGCTCC GGTTCGCCCT CGACGGCATC
CTGTCCTCCG GCTTCCCGTT CCTGACCTTC TTTCCCGCCG TCATCATCAC CACCTTCGCC
TGCGGCCCGC GGCCGGGCAC GGTCTGCGCC GTGCTGTCGG GCGTGGCGGC CTGGTTCGTA
TTCCTGACCC CGCGCTTCTC CCTGACGCTG ACCGGGGAGA GCGCCCTCGC GCTCGCCCTC
TTCGCCTTCA TCGTCGGCGT CGACATCCTG GTCATCGCCT CGATGCAGAG CGTCTCCGAG
CGGCTGCAGC GGGAGAAGGA GGTGTCCCAC CGCCTGTCGG AGCAGCAGCG CGTCCTGTTC
CAGGAACTGC AGCATCGCGT CGCCAACAAC ATGCAGTTCG TCGCCTCCCT GCTGGCGCTG
CAGCGGCGGC AGGCGGCCGG CGACCCGCAG CGGGCCCTGC TGGCGCTCGA CGAGGCGCGG
ACGCGCCTGG AGACGATCGC CCGCATCCAC CGGCGGCTCT ACGATCCCGA CAGCCTCGAC
CGGCCGGTCG GCGAGTACCT GCAGGAGATC TGCTCGGACC TGATCGCGAC GGCCGGTGCG
GGGGAGATCG TCTGCCGGGT CGAGATCGAG CCGCTGCGCC TCGACCTGAG CCGCCTGACC
ACCCTCTCGC TGCTGGTGGT CGAGGTCGTC ACGAACGCGC TCAAGCACGC CTTCCCGCCG
GGCGGGCCGG GCACGATCAC GATCCGCCTG GACCGCCTCG GGGCCGGGCG GGCCCGGCTC
ACCATCGCGG ATGACGGGCG CGGCCTGTCC GCGGGTTTCG ACCGGCAGGT CGGGTCGCGC
CTGGGCTTCC GGATCGTTCA GAGCCTCGCC GCGCAACTCG GCGGGGGAGA TCCGCTTCGC
CTCCGAGGGC GGCACCGTCG CGCGGCTCGA TTTCGGGCTG TGACCGCGGC GGCCGGCCCC
TCATCCCAGC GGCCCGGGAT GCCGGCGCAT CGGGCCGTCG ACCCTCTCGA AGGGTCGGCA
CCGCGCTGA
 
Protein sequence
MDDEMTATQA ATSLPPSRLT RLERWTQATR QVRRSRRLSD ACGVAAFAVA LALRFALDGI 
LSSGFPFLTF FPAVIITTFA CGPRPGTVCA VLSGVAAWFV FLTPRFSLTL TGESALALAL
FAFIVGVDIL VIASMQSVSE RLQREKEVSH RLSEQQRVLF QELQHRVANN MQFVASLLAL
QRRQAAGDPQ RALLALDEAR TRLETIARIH RRLYDPDSLD RPVGEYLQEI CSDLIATAGA
GEIVCRVEIE PLRLDLSRLT TLSLLVVEVV TNALKHAFPP GGPGTITIRL DRLGAGRARL
TIADDGRGLS AGFDRQVGSR LGFRIVQSLA AQLGGGDPLR LRGRHRRAAR FRAVTAAAGP
SSQRPGMPAH RAVDPLEGSA PR