Gene Smed_5576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5576 
Symbol 
ID5319878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp541185 
End bp542774 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content55% 
IMG OID640777323 
Producthistidine kinase 
Protein accessionYP_001314255 
Protein GI150377660 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.44664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0252292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAGTG AAGCCGCGGT GCCCTCGATT TCAATTCCCG ATTTAGAGAC CTTAGCAACA 
TCAATCCATG ACGATACAGC CTTCGTCGTC TTGACTGAGG AGGTGTTGCG GGGACGGGAT
ATCGGGAGGA TCTCGACATT GCTGGCAAAT CAGCCGAGCT GGTCGGACCT CCCCTTTATC
ATCCTCACTT CGCGTGGGGG GCCGGATCGC CATTCCAGTG CCCGACTTTC AGAATCACTC
GGCAACGTCA CTTTTCTCGA GCGGCCTTTC CACCCCACAA CATTCATTAG CGTCGCTCGC
TCGGCTATGA AGGGACGGCG TAGGCAGTTT GAAGCGCGTA CCAGGCTTGA GGAGATCAGT
CGCCTCAATG AGACGCTTGA GGAGCGCGTG GCGATACGCA CTGCAGAATT GCAGCGCGCG
AACAGAGTTC TCTCTGAGCA GATTGCACAG CGCGAAGATG CCGAAGAGAG ACTGCGGCAA
TCTCAAAAAC TTGAAGCCAT CGGCCAGTTG ACTGGCGGCG TTGCCCACGA CTTCAACAAT
CTCCTCATGG CAGTGCTCGG CAATCTTGGC CTGCTATCTA AATATGTCTC TCACGACCCG
AATGCAGCCC GTCTCCTTGA AGGGGCGACG AGAGGGGCGC AAAGAGGGGC GGCACTTACC
CAGAGGCTAC TCGCATTTGG ACGCCGACAG GACCTGACTG TTAGACCCAC AGATATGGTG
GGTCTCATCA TTGGTATGGA TGATCTTTTG ATCCGATCAA TAGGCCAGAA TATTGAGCTC
GAGAAGCACC TGCCACGGCA GTTACCCAGG GCGCTGATAG ATGCCAACCA AGTCGAACTG
GCGCTGCTTA ACCTTGCGAT CAACGCGAGA GATGCAATGC CAAGCGGCGG AAAGCTGGTG
CTCTCTGTGA GGCAGGAACG TCTCTCTGCC ACGCGAGGCG AGTTGTGTGC ACGCGAATAT
CTTGTCCTTT CGGTTTCGGA CACGGGCCAT GGCATGGACG CGGCAACCCT CAAGAGAGCT
ATAGATCCGT TTTTCTCAAC CAAGGGGCCG GGTAAAGGCA CGGGGCTCGG ACTCTCAATG
ATCCACGGTG TTGCGGTCCA AATGAACGGC GCGCTGGAAC TTACTAGCGT ACTCAACGAA
GGTACGACCG CTGAGTTGTG GTTTCCGGCG ACTTCAGAGG CCACGCTTGA CGAACCGGTC
AAACCGCCGG TCGCATCATC CGAAACCGCA AAGTTGCTAC GAGTTTTGTT GGTAGACGAC
GACGCACTTA TCGCGATGAG TTCAGTCGAT ATGCTGGTGG ACTTGGGACA TACTGTAACT
GAAGCCAATT CGGGCAAAGC GGCTTTGGCG CTGCTTGAGG CGGGTAACGA GTTCGATCTT
ATGATCACCG ATTACTCGAT GCCTGGAATG AATGGTGCCG AACTCGCTCG CGCCGCACTA
CTGCTTGCAC CGAAAATGCA AATTCTTGTC GCATCTGGGT ATGCGGAACT TCCATCAGGC
GCGGGTATCG ATCTTCCCAA GCTTGGAAAA CCATATAGTC AGTCGCAACT TGCGGATGAG
ATAAGCAAGT TGTTCGCCGG GGATGAGTAA
 
Protein sequence
MLSEAAVPSI SIPDLETLAT SIHDDTAFVV LTEEVLRGRD IGRISTLLAN QPSWSDLPFI 
ILTSRGGPDR HSSARLSESL GNVTFLERPF HPTTFISVAR SAMKGRRRQF EARTRLEEIS
RLNETLEERV AIRTAELQRA NRVLSEQIAQ REDAEERLRQ SQKLEAIGQL TGGVAHDFNN
LLMAVLGNLG LLSKYVSHDP NAARLLEGAT RGAQRGAALT QRLLAFGRRQ DLTVRPTDMV
GLIIGMDDLL IRSIGQNIEL EKHLPRQLPR ALIDANQVEL ALLNLAINAR DAMPSGGKLV
LSVRQERLSA TRGELCAREY LVLSVSDTGH GMDAATLKRA IDPFFSTKGP GKGTGLGLSM
IHGVAVQMNG ALELTSVLNE GTTAELWFPA TSEATLDEPV KPPVASSETA KLLRVLLVDD
DALIAMSSVD MLVDLGHTVT EANSGKAALA LLEAGNEFDL MITDYSMPGM NGAELARAAL
LLAPKMQILV ASGYAELPSG AGIDLPKLGK PYSQSQLADE ISKLFAGDE