Gene Smed_5952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5952 
Symbol 
ID5320254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp914865 
End bp915932 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content64% 
IMG OID640777639 
Productsignal transduction histidine kinase 
Protein accessionYP_001314571 
Protein GI150377976 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.113414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTGG GCAGGGTGCT CTACATTGAC GACGATCCAG CCCTTCGGCG TCTGGTCGGT 
AAGGAGTTCG AGCGGCACGG CTACATTGTC CAGCTTGCTG CCACAGGCGA CGAGGGGCTT
CGCCACTTGC GCGCGGGCGG CATCGATGTT GTGGCACTCG ACCACTACAT GCCCGGCCAG
GATGGGCTCC AGACACTCGC TTCCATCCGC GCCGATCCTG ATCCGCCGCC GGTGGTCTAT
GTGACCGGCT CGGAGGAGGG TCGGGTTGCG ATCGCAGCAC TGAAGGCAGG CGCGACAGAC
TATGTCCTCA AGGATGTTGG TGGCGAGTTT CTCGCGCTGT TGCGGGTGGC GATTGAAGGC
GCCCTTGCTC AGGCGGACCT GCGGCGCGAA AAGGAGGAGG CGGAAGCCGA AGTGCGAGCC
GCGCGTGACC GGTTCGAGGC GCTCGCGGCT GAAAGAGCCG TGCTGCTTCG CGAAGTGAAC
CACCGGGTCG GTAACAGCCT GCAGCTCGTC TCTACCTTCC TCCTGATGCA AAGCGACATG
AGCGATGAGC CCCATGTAAA GGCGGCGCTC GCCTCTGCCT ATGGCCGTGT GCTTGCCATC
GCCCAGGTGC ACAAGCGCCT CTATACGTCA GACGATGTGC GCACAGTCGC GCTCGACAAC
TATCTCCATG CGCTCGTCGT AGACATTGGC GCCTCCGCGG CCGGCGCGAA CGGGTGGCTC
TCTCTCGCGG CCGATCCGGT CGCGATAGAC CCAGACCGTG CCGTGGCAGT CGGCGTTATT
GTCACCGAAT TGATCATCAA TGCGATGAAA CACGCATACC CCCGCGGGGA GGGGCCCGTT
CGTGTTGCGC TACATGCGCC GGCAGGAAAT AGCGTTTGCC TGTGCGTGGA GGATGATGGA
CTGGGGAGCC GATTGCCGCC CGCAGAAGGC TCGACGGGGC TTGGGCAGCT CATCATCGAG
GCCATGGCGG TGAAGCTCGG AGCGGTTGTG ACCGTCCACG CACGCGACCC CGGGACGCGG
GTCGTTGTCG ATTTCATGAA AGCCGAGGCC AAACGACTGG TGGATTAG
 
Protein sequence
MALGRVLYID DDPALRRLVG KEFERHGYIV QLAATGDEGL RHLRAGGIDV VALDHYMPGQ 
DGLQTLASIR ADPDPPPVVY VTGSEEGRVA IAALKAGATD YVLKDVGGEF LALLRVAIEG
ALAQADLRRE KEEAEAEVRA ARDRFEALAA ERAVLLREVN HRVGNSLQLV STFLLMQSDM
SDEPHVKAAL ASAYGRVLAI AQVHKRLYTS DDVRTVALDN YLHALVVDIG ASAAGANGWL
SLAADPVAID PDRAVAVGVI VTELIINAMK HAYPRGEGPV RVALHAPAGN SVCLCVEDDG
LGSRLPPAEG STGLGQLIIE AMAVKLGAVV TVHARDPGTR VVVDFMKAEA KRLVD