Gene Smed_5895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5895 
Symbol 
ID5320197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp860073 
End bp861143 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content58% 
IMG OID640777590 
Productsignal transduction histidine kinase 
Protein accessionYP_001314522 
Protein GI150377927 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.677371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.686603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCG ATAATGTGCC TGAACAAGTC GATCGGCTTC TCGGCAATTC CGACCTCGTG 
GAAGCTCTCG AAAACAAACA GTTCAAACGA TTTCTCGATC AAGTGCCGAT CGCTCTCGTG
ATCTCCAGAA TCGGTAGCGG GGGCGAACGG ATAATCTACG CGAATCCCGA ATTCGAGAGG
CTCTCGGGAT TGGCGGTGGC GGAGGTCGAG AACCAGGAAT GGGCCGTGCT TGAAGCAGTG
CCCGCGGCCG CACAGGAGGG AGCATCGCTC GGTCAAGCCG TTACTGCCGG GACCGACCGC
ATCGGAACTT ACAAAAGGAA TGCGGGAGAC TCGACCGCCC TGCTCGACGT CTATTCAAAT
GTGGTCGAAG ACGACGAAGG AACGCCGTGC TTCCGCCTCG TCGCACTTGT GGATGTTACC
GAGCACAAGC AGACCGAACG CGAGGAGCTC GAAAGTCGCA TTAAGGAGCA GGATCTCCTG
CTTCGGGAGC TGCAGCATCG TGTGAAGAAT AATCTTCAGA TGATAACGGC GCTCATCCGT
CTCGAAGCTC GCGGCAACCC GCCTCCCGAC ACGCGCTCTT TCGAAAGGTT GGCGGGCCGT
GTCGAAGCAC TCACCACCCT TTATGACGCC ATGGCTAATG GCGACAGCAG CCAGGAAGTC
GATCTCGGCA CCTATATCGG TCAGATCGCA GCTGCCGTCA TGGCCTCGAA CGCCTGCGAT
GGGGTCAGCC TCGACATGAA AATCGATCCC TATCCGGTTT CCGTCAACGT CGCCATGCCA
ACTGGACTGG TCGTCAACGA GCTGCTCACC AACGCGCTCA AGCATGCTTT CAACGGCCGC
GAAGGAGGAG TAATCACGCT GCGAAGCACT TTTGAGGATG ATGGCTACCG TGTCATCGTT
GCGGACGACG GAATAGGTTT CCCGGACGGA GAGACCTGGC CCAAACACGG CAAGCTTGGC
GAGTTGATCG CGCAGTCGCT TCGCGAAAAT TCCAGGGCTG ATCTCCAGGT GATCTCCACG
CCGGGTCAAG GCACACGCGC AACGATTCGT TTCCGGAACG ACTCCGTATA G
 
Protein sequence
MTSDNVPEQV DRLLGNSDLV EALENKQFKR FLDQVPIALV ISRIGSGGER IIYANPEFER 
LSGLAVAEVE NQEWAVLEAV PAAAQEGASL GQAVTAGTDR IGTYKRNAGD STALLDVYSN
VVEDDEGTPC FRLVALVDVT EHKQTEREEL ESRIKEQDLL LRELQHRVKN NLQMITALIR
LEARGNPPPD TRSFERLAGR VEALTTLYDA MANGDSSQEV DLGTYIGQIA AAVMASNACD
GVSLDMKIDP YPVSVNVAMP TGLVVNELLT NALKHAFNGR EGGVITLRST FEDDGYRVIV
ADDGIGFPDG ETWPKHGKLG ELIAQSLREN SRADLQVIST PGQGTRATIR FRNDSV