Gene Smed_5742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5742 
Symbol 
ID5320044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp708934 
End bp710130 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content59% 
IMG OID640777455 
Productsignal transduction histidine kinase 
Protein accessionYP_001314387 
Protein GI150377792 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGG CGAAGGTGTC CAAGCCCGAC GTTGCGGCGG TCTACACGGT CGACCTGAGT 
CAGCGGCCAG TGCCACCCAG CGATGCCGCG CAGGAGAAGG CCGCACTTCT TCAGCTCGCC
GGACGCATGC ACGATGGCCC TGGAGAGGTG CTCCCAAAGT TCGTCGAACT CGCCATGGAA
CTGACAGGCG CGATTTCCGC AGGTGTCAGC CTGCTCGAAC AGACAGAACC GTTCCCGGTG
TTCCGCTGGC ATAATCTGAG GGGAATTCTC TCGCCGTTCA ACGGGGCGAC TACTCCCCGC
GATTACTCCC CCTGTGGTAT CACGCTCGAC CGCGGCGCAC CGACGCTGGC GATCCATCCA
GAACGCGTCT ATGACTGGAT ACCATCGGGC TTGTCGCTGC CAGAGGTTTT GCTTGTGCCG
CTATATATTG GGCGGACGGA ACCACTCGGG ACCCTTTGGG TCGTCGCCGA CAGGATCGGG
TATTTCCACT GCGGTCACGG AGCCACTCTG CAGGAGCTTG CCGATTTCAT CGGCGTAGCA
CTGAAGATGG CCCGATCCGA ACAGGAACTC CTGCAGGCGC TTGAACAGCA GGAATTGCTC
ACCAGAGAGA TGAGCCATCG GCTCAAAAAT CTCTTCACCA TCGTAGACGG CATGGTCCGC
ATCAGCGCCC GCAGCACAAA TAGTAAGGAT GATTTGGTCT CGCTATTGTC TGGCCGGTTG
CATGCGCTCG CCGCTGCGCA TTCCTTGGTG AAGCCATCGT TTAGCGACGT CCAAGGGGCG
GCCTCCAATC TTGCGGACCT GATAAGCATC GTCATCGAGC CGCACAAGCC TGCGGCAATT
TTCGGCAGGA GCCGGTTCTC TCTTGAAGGC CCTGCAATTC TTTGCGGCGA GCAATCGGTC
AACGGACTCG CGCTGGTCTT TCATGAACTG ACAACCAATG CGGCTAAATA CGGGGCGCTG
CGCGGCGACA ATGGCCGGGT GGATATTGTC TGGCAAGTCA ATGGCGACGA CCTGAGCATC
ACTTGGAGTG AAGACGGGGG AGGAGAAATA ACCTCCCCCC CTGTATCCAA GGGCTTCGGC
AGCACGTTGG TGGACGCGAC TGTGACCCGC CAATTCGGTG GTATCCTCAG CTATGACTGG
CGTAAGACCG GCTTATCAGT GAACATAGTC CTGCCATTGT CCCGTCTGGC ACAATAG
 
Protein sequence
MEAAKVSKPD VAAVYTVDLS QRPVPPSDAA QEKAALLQLA GRMHDGPGEV LPKFVELAME 
LTGAISAGVS LLEQTEPFPV FRWHNLRGIL SPFNGATTPR DYSPCGITLD RGAPTLAIHP
ERVYDWIPSG LSLPEVLLVP LYIGRTEPLG TLWVVADRIG YFHCGHGATL QELADFIGVA
LKMARSEQEL LQALEQQELL TREMSHRLKN LFTIVDGMVR ISARSTNSKD DLVSLLSGRL
HALAAAHSLV KPSFSDVQGA ASNLADLISI VIEPHKPAAI FGRSRFSLEG PAILCGEQSV
NGLALVFHEL TTNAAKYGAL RGDNGRVDIV WQVNGDDLSI TWSEDGGGEI TSPPVSKGFG
STLVDATVTR QFGGILSYDW RKTGLSVNIV LPLSRLAQ