Gene Smed_2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2344 
Symbol 
ID5323205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2420305 
End bp2421807 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content63% 
IMG OID640791282 
Productsignal transduction histidine kinase 
Protein accessionYP_001328011 
Protein GI150397544 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00211814 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAAA AGACAGAGGC GATGCTCCCA GCCGTAGCGC AGGTGCTCGA TGCGCCGGAA 
CGGCTCGGCG TTCTTCAGGC CGCGGTGCCG GACATGTCGA TCCCAGACGG GGATTTCGAC
GGGCTTGCGG GGCTTGCGGC CAGCCTTTTC GATGCGCCGG TCGCCCTCAT CACCCTCGTG
GACCACGAAT GGCAGTGGTT CAAGGCTTCC GTCGGCACGG CGGAAACGCG GCTGCACGTT
CGTGAGTCGT TCTGCGTCCA TACGATCGCA GAAGGCGACG GCGCCTTTGT CGTGCTCGAT
GCCTCTCGCC ACCCGGCGTT CAGACAGCGT CGGGCGGTCG CCGGCCCGCC CTCCGTGCGC
TTCTATGCGG GCGCCCCGAT CATTCTGGAC GGGCAGGCGA TAGGAACAGT CGCGGTCGTC
GACGTCGCTC CCCGTTTCGA GATATCCGTC AAGCAGCAAG GCGAGCTGCA GCGCATCGCA
GGCGTCGCCG CCTCGCTGTT CAAACTGAAG GACGAAACAC GCCGGCGTGC GCTCAAGGAG
GCCGCGCTTT CCCGCGAAGA GCAGCGGCTT GCCATGGCGC TCGATGCGGC CAATGTCGGC
AGCTGGCTCT GGGACATTCG GGCCGGCACG GTTTCGGGCA ACGGCGCAAT GATGCGCATG
TTCGGCCTTC CGCCAGAACG CACCGTTGGC GCCAAGGCCA TATTTTCCGC CATCCATCCC
GACGATCGCA TGCCGACCTT CTCGAAACTT CGCCAGGCGA TGGCTGCCAA TGAGGAATAT
GACGGCATGT TTCGCATCGG CACCAATGGA AGGTGGCTGC TTGGCCGCGG CCGTGTGCAC
GACCGCGACA GCAAGGGTGC ACCCTTGAGC TTTCTCGGCA TGACGATCGA TGTTTCGGAG
CAGCAGGCGT CGGTGAACCG CACGCGGCTG CTGCTGAAGG AACTGAACCA CCGGGTCAAG
AACACGCTGG CAATGCTCCA GTCACTCGCC CGCCAGACGC TTCGCCAGAC GAGCGACCCG
GCCGAATTCA TGACCGCCTT CGCCGGCAGG CTTCAGGCGA TCTCCGAGGC GCATGGGCTC
CTTTCCGATT ACGAATGGGG CACGATCCAC CTGTCCGAAC TGATTTCGAA ACAGTTGCTG
CCCTATGTCA GCGATTACTC CCAACAGATC GAATTGCACA AGGATGAGAT CCTGCTTGGT
CCGGACCAGG CCGTGGGGCT CGGGCTGGTT CTGCACGAAC TGGCGACCAA TGCCGTAAAA
TACGGCGCTC TCTCGCTGCC GACGGGAAAA ATCGTGCTGA CGGCCCGCCG CTTAATCGAG
GACGGAGAAT CCGTGTTGCA TCTGACCTGG ACTGAAGTGG GCGGCCCCCC GATTCGCGAG
CCGCGCCGCC GCGGTTTCGG ATCTATCCTG ATCGAACGCA GCCTCGACAA GATCATCGGC
AGCTCGGTCA AGGTAGAATA TCTGCCGGCG GGAGTCACCG CGTTGATCCG GCTGCCGCTT
TGA
 
Protein sequence
MHEKTEAMLP AVAQVLDAPE RLGVLQAAVP DMSIPDGDFD GLAGLAASLF DAPVALITLV 
DHEWQWFKAS VGTAETRLHV RESFCVHTIA EGDGAFVVLD ASRHPAFRQR RAVAGPPSVR
FYAGAPIILD GQAIGTVAVV DVAPRFEISV KQQGELQRIA GVAASLFKLK DETRRRALKE
AALSREEQRL AMALDAANVG SWLWDIRAGT VSGNGAMMRM FGLPPERTVG AKAIFSAIHP
DDRMPTFSKL RQAMAANEEY DGMFRIGTNG RWLLGRGRVH DRDSKGAPLS FLGMTIDVSE
QQASVNRTRL LLKELNHRVK NTLAMLQSLA RQTLRQTSDP AEFMTAFAGR LQAISEAHGL
LSDYEWGTIH LSELISKQLL PYVSDYSQQI ELHKDEILLG PDQAVGLGLV LHELATNAVK
YGALSLPTGK IVLTARRLIE DGESVLHLTW TEVGGPPIRE PRRRGFGSIL IERSLDKIIG
SSVKVEYLPA GVTALIRLPL