Gene Smed_6274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6274 
Symbol 
ID5320576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1192563 
End bp1194167 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content60% 
IMG OID640777873 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001314805 
Protein GI150378210 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.311499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTATG GGATAGCCGA GATGCGGTGG GCCGTATCCG ACGCTGGGTG CAGCAGTAAT 
ACATCATCGT CTCCGGTGGA TAACGGAATG CATGCAGAAA GCGTTGTTGA GAGGACGAGG
CTCGGGCAGA AGCTCGTCCG GTGGCGAGGC GACGGGATCT CGGCCTATGT TCTCGGGACT
GTAGCTGTCT TAAACATTCT TGCCATTAGA CTCGCATTTC GGGAACTCTT CGGCGACAGC
TTTCTGCTCT TGTCGCTTAC CCCTGCCGTT CTCGTGGTCG CGATGGTCGG CGGGCTGAAA
CCGATCACCT TCGCAGCAGG GCTTTCGCTT CTTGCAGCAT CATCTCTTCG TTGGATAGAG
AACCCGTTCG AGCCGACAAT CGGCGAGCTG ATCGCCTTCG GCTCGACACT GCTCCTGATC
GTGGCTCTGG GAGAAGTGCT TCAGGCGGCA AGACGCGCCA TCGACCGGAC CGAGGGTGTC
GTAAGAGCCC GCGACGCGCA TCTGAGATCC ATACTGGATA CTGTTCCGGA TGCCACAGTG
GTCAGCGCTA CCGACGGCAC AATCGTATCC TTCAACGCCG CGGCCGTCCG GCAGTTCGGA
TACGCTGAGG AGGAGGTCAT CGGCCAGAAC CTGCGCATAT TGATGCCGGA ACCCTACCGC
CACGAACACG ACGGATATCT GCAGCGCTAC ATGGCAACCG GGGAAAAGCG CATCATCGGT
ATCGATCGCG TTGTCTCGGG GCAGCGGAAG GATGGATCGA CGTTTCCGAT GAAGCTCGCC
GTGGGGGAGA TGCGCTCGGG CGGCGAGAGG TTCTTCACAG GCTTCATCAG AGACCTTACG
GAGCGGGAGG AGTCTGCCGC ACGGCTCGAG CAGATACAGG ATGAACTGGC GCGCCTTGCC
CGCCTAAACG AGATGGGCGA AATGGCTTCG ACGCTTGCCC ACGAACTGAA CCAGCCGTTG
TCGGCGATCG CCAACTATTC GCATGGCTGT ACGAGGCTGT TGCGTGACAT GGACGACGCC
GTCGCTACGC GAATGCGAGA GGCGCTCGAA GAGGTGGCGA GCCAGTCGCT GCGGGCCGGC
CAGATCATCA AACATCTGAG GGAATTCGTC ACCAACGGCG AGACGGAGAA GGCTCCGGAA
GACATTCGCA AGCTGGTCGG GGAGTCTGCG GCCCTGGCTC TGGTCGGTTC GCGCGAGCAG
GGCGTCCGCA CCGTATTCGA GTATCTGCCC GATGCCGAAA TGGTAATGGT CGACCGGATC
CAGGTGCAGC AGGTCCTCAT CAATCTGATG CGCAACGCGA TAGAGGCGAT GCGCCACGTC
GACCGCCGGG AGCTGACGAT CCGCACGATG CCGGCCGATC CGGGCGAGAT AGCAGTCGTC
GTTGAAGACT CCGGCGGAGG CATTCCGGAA GAAGTCGCCG GTCAGCTCTT CAAGCCGTTC
GTCACGACCA AGGCAAGCGG AATGGGCATC GGACTGTCCA TTTCGAAGCG GATCGTCGAG
GCGCATGGCG GTGAGATGAC TGTCTCGAAA AATGCAGCCG GCGGGGCCAC TTTCCGGTTC
ACGCTTCCCG CCTATCTAGA AGAACGGATC GTTGCAAATG ACTGA
 
Protein sequence
MVYGIAEMRW AVSDAGCSSN TSSSPVDNGM HAESVVERTR LGQKLVRWRG DGISAYVLGT 
VAVLNILAIR LAFRELFGDS FLLLSLTPAV LVVAMVGGLK PITFAAGLSL LAASSLRWIE
NPFEPTIGEL IAFGSTLLLI VALGEVLQAA RRAIDRTEGV VRARDAHLRS ILDTVPDATV
VSATDGTIVS FNAAAVRQFG YAEEEVIGQN LRILMPEPYR HEHDGYLQRY MATGEKRIIG
IDRVVSGQRK DGSTFPMKLA VGEMRSGGER FFTGFIRDLT EREESAARLE QIQDELARLA
RLNEMGEMAS TLAHELNQPL SAIANYSHGC TRLLRDMDDA VATRMREALE EVASQSLRAG
QIIKHLREFV TNGETEKAPE DIRKLVGESA ALALVGSREQ GVRTVFEYLP DAEMVMVDRI
QVQQVLINLM RNAIEAMRHV DRRELTIRTM PADPGEIAVV VEDSGGGIPE EVAGQLFKPF
VTTKASGMGI GLSISKRIVE AHGGEMTVSK NAAGGATFRF TLPAYLEERI VAND