Gene Smed_4177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4177 
Symbol 
ID5318586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp653182 
End bp655044 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content66% 
IMG OID640775982 
Producthistidine kinase 
Protein accessionYP_001312915 
Protein GI150376319 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.661658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.571454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAATA CCGGCATGAG CAAACACGCA GACATGGAAG ATCCGGCGGC GCTTCGCAGC 
CGCGCCCGTC GATCCTGGCT CGTGTTTGCG GCCGTCGCCC TGGTCCTCGT CGCTGCCGGC
CTCCTTTTTG CCCGCGACTA CGGGCGGTCA CGGGCCCTCT TCAGCCTTGC CGGTCAAAGC
CGGATCGACG CCAGCCTCAA AGCCTCGCTT CTCCGGGCGG TCGTGGAGCG GCAGCGCGCC
CTTCCGCTGG TTCTCGCCGA CGACGCGGCC ATTCGTGGCG CGCTGCTTAC CCCGGATCGG
CAATCGCTGG ACCGGATCAA CCGCAAGCTC GAAGGTTTGG CGACGAGCGC CGAAGCGGCG
GTCATCTACC TCATCGACCG GAACGGCACC GCCATCGCCG CCAGCAATTG GCGGGATTCC
ACCAGCTTCG TCGGAAACGA TTATGCCTTC CGCGATTATT TCCGGCTCGC CATGCGCGAC
GGCATGGCCG AGCATTTCGC CATGGGCACA GTAAGCAAAC GGCCCGGGCT CTATATCTCC
CGTCGCGTCG CTGGGCCCGG TGGCCCGCTG GGGGTGATCG TTGCCAAGCT CGAATTCGAC
GGAGTCGAGG CGGATTGGCA GGCCTCCGGC AAGCCGGCCT ATGTCACCGA CCGGCGCGGC
ATCGTCCTCA TCACCAGCAT GCCCTCCTGG CGCTTCATGA CGACCAAGCC GATCGCGCGG
GAGCGCCTTG CCCCAATTCG TGAAAGCCTG CAATTCGGTG ACGCACCGCT GTTGCCGCTC
CCCTTCCGCA AGGTCGAGGC GCGGCCCGAT GGCTCCTCCA CCCTCGACGC CCTGCTGCCG
GGCGAGGCGA ACGCAGCCTT TCTGCGCGTC GAGACAATGG TGCCGTCGAC GAACTGGCGC
CTCGAGCAAT TGTCGCCGCT GAAAGCGCCG CTTGCCGCCG GAGCACGCGA AGCGCAGCTC
ATCACGCTTG CCGCGCTGGT GCCGCTTCTG GGGCTGGCGG CATACCTCCT GCGCCGGCGC
CAGGTGATCG CCATGCGCAG CGCCGAGGAG CGGCTCGCCC GCAGCAAACT CGAGACAAGC
GTCGAGGAAC GGACGCGCGA TCTCCGCATG GCACGCGACC GGCTCGAAAC CGAAATCGCC
GATCACCGCC AGACCACCGA GAAATTGCAG GCAGTCCAGC AGGACCTCGT CCAGGCAAAC
CGGCTGGCGA TCCTCGGCCA GGTCGCAGCC GGCGTCGCCC ATGAGATCAA CCAGCCTGTC
GCCACCATCC GCGCCTATGC GGACAATGCC CGCACATTTC TCGAGCGCGG TCAGAGCGCC
ACCGCAGCCG AAAACATGGA GAGCATCGCC GAACTCACCG AGCGCGTCGG CGCCATTACC
GACGAACTTC GTCGCTTCGC CCGCAAGGGT CATTTCGCCG CCGGGCCGAC CGCGATGAAG
GATGTGATCG AGGGAGCGCT CATGCTGCTC CGCAGCCGTT TTGCCGGGCG GATGGACGCA
ATCCGCATCG ATCTGCCGCC GGATGGCCTT CAGGTATTCG GCAATCGCAT CCGGCTGGAG
CAGGTCCTGA TCAACCTGCT GCAGAACGCG CTCGAGGCGA TCGGCGACAG CGGGAATGGC
GCGATCAAAG TGTCGTGCGA GGAAACGGCC ACCGCCGTAA CGCTCACCGT CGCCGACAAC
GGTCCGGGGA TTCCGGCCGA TGTCCGCGAA GAGCTGTTCA CGCCCTTCAA CACCTCGAAG
GAAGACGGGC TTGGTCTCGG CCTGGCAATC TCCAAGGAGA TCGTCTCCGA CTATGGCGGC
ACGATCGAGG TCGCGAGCAG CCCGTCCGGA ACGACATTTA TCGTAAATCT CATGAAGGCA
TGA
 
Protein sequence
MHNTGMSKHA DMEDPAALRS RARRSWLVFA AVALVLVAAG LLFARDYGRS RALFSLAGQS 
RIDASLKASL LRAVVERQRA LPLVLADDAA IRGALLTPDR QSLDRINRKL EGLATSAEAA
VIYLIDRNGT AIAASNWRDS TSFVGNDYAF RDYFRLAMRD GMAEHFAMGT VSKRPGLYIS
RRVAGPGGPL GVIVAKLEFD GVEADWQASG KPAYVTDRRG IVLITSMPSW RFMTTKPIAR
ERLAPIRESL QFGDAPLLPL PFRKVEARPD GSSTLDALLP GEANAAFLRV ETMVPSTNWR
LEQLSPLKAP LAAGAREAQL ITLAALVPLL GLAAYLLRRR QVIAMRSAEE RLARSKLETS
VEERTRDLRM ARDRLETEIA DHRQTTEKLQ AVQQDLVQAN RLAILGQVAA GVAHEINQPV
ATIRAYADNA RTFLERGQSA TAAENMESIA ELTERVGAIT DELRRFARKG HFAAGPTAMK
DVIEGALMLL RSRFAGRMDA IRIDLPPDGL QVFGNRIRLE QVLINLLQNA LEAIGDSGNG
AIKVSCEETA TAVTLTVADN GPGIPADVRE ELFTPFNTSK EDGLGLGLAI SKEIVSDYGG
TIEVASSPSG TTFIVNLMKA