Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4177 |
Symbol | |
ID | 5318586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 653182 |
End bp | 655044 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640775982 |
Product | histidine kinase |
Protein accession | YP_001312915 |
Protein GI | 150376319 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.661658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.571454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAATA CCGGCATGAG CAAACACGCA GACATGGAAG ATCCGGCGGC GCTTCGCAGC CGCGCCCGTC GATCCTGGCT CGTGTTTGCG GCCGTCGCCC TGGTCCTCGT CGCTGCCGGC CTCCTTTTTG CCCGCGACTA CGGGCGGTCA CGGGCCCTCT TCAGCCTTGC CGGTCAAAGC CGGATCGACG CCAGCCTCAA AGCCTCGCTT CTCCGGGCGG TCGTGGAGCG GCAGCGCGCC CTTCCGCTGG TTCTCGCCGA CGACGCGGCC ATTCGTGGCG CGCTGCTTAC CCCGGATCGG CAATCGCTGG ACCGGATCAA CCGCAAGCTC GAAGGTTTGG CGACGAGCGC CGAAGCGGCG GTCATCTACC TCATCGACCG GAACGGCACC GCCATCGCCG CCAGCAATTG GCGGGATTCC ACCAGCTTCG TCGGAAACGA TTATGCCTTC CGCGATTATT TCCGGCTCGC CATGCGCGAC GGCATGGCCG AGCATTTCGC CATGGGCACA GTAAGCAAAC GGCCCGGGCT CTATATCTCC CGTCGCGTCG CTGGGCCCGG TGGCCCGCTG GGGGTGATCG TTGCCAAGCT CGAATTCGAC GGAGTCGAGG CGGATTGGCA GGCCTCCGGC AAGCCGGCCT ATGTCACCGA CCGGCGCGGC ATCGTCCTCA TCACCAGCAT GCCCTCCTGG CGCTTCATGA CGACCAAGCC GATCGCGCGG GAGCGCCTTG CCCCAATTCG TGAAAGCCTG CAATTCGGTG ACGCACCGCT GTTGCCGCTC CCCTTCCGCA AGGTCGAGGC GCGGCCCGAT GGCTCCTCCA CCCTCGACGC CCTGCTGCCG GGCGAGGCGA ACGCAGCCTT TCTGCGCGTC GAGACAATGG TGCCGTCGAC GAACTGGCGC CTCGAGCAAT TGTCGCCGCT GAAAGCGCCG CTTGCCGCCG GAGCACGCGA AGCGCAGCTC ATCACGCTTG CCGCGCTGGT GCCGCTTCTG GGGCTGGCGG CATACCTCCT GCGCCGGCGC CAGGTGATCG CCATGCGCAG CGCCGAGGAG CGGCTCGCCC GCAGCAAACT CGAGACAAGC GTCGAGGAAC GGACGCGCGA TCTCCGCATG GCACGCGACC GGCTCGAAAC CGAAATCGCC GATCACCGCC AGACCACCGA GAAATTGCAG GCAGTCCAGC AGGACCTCGT CCAGGCAAAC CGGCTGGCGA TCCTCGGCCA GGTCGCAGCC GGCGTCGCCC ATGAGATCAA CCAGCCTGTC GCCACCATCC GCGCCTATGC GGACAATGCC CGCACATTTC TCGAGCGCGG TCAGAGCGCC ACCGCAGCCG AAAACATGGA GAGCATCGCC GAACTCACCG AGCGCGTCGG CGCCATTACC GACGAACTTC GTCGCTTCGC CCGCAAGGGT CATTTCGCCG CCGGGCCGAC CGCGATGAAG GATGTGATCG AGGGAGCGCT CATGCTGCTC CGCAGCCGTT TTGCCGGGCG GATGGACGCA ATCCGCATCG ATCTGCCGCC GGATGGCCTT CAGGTATTCG GCAATCGCAT CCGGCTGGAG CAGGTCCTGA TCAACCTGCT GCAGAACGCG CTCGAGGCGA TCGGCGACAG CGGGAATGGC GCGATCAAAG TGTCGTGCGA GGAAACGGCC ACCGCCGTAA CGCTCACCGT CGCCGACAAC GGTCCGGGGA TTCCGGCCGA TGTCCGCGAA GAGCTGTTCA CGCCCTTCAA CACCTCGAAG GAAGACGGGC TTGGTCTCGG CCTGGCAATC TCCAAGGAGA TCGTCTCCGA CTATGGCGGC ACGATCGAGG TCGCGAGCAG CCCGTCCGGA ACGACATTTA TCGTAAATCT CATGAAGGCA TGA
|
Protein sequence | MHNTGMSKHA DMEDPAALRS RARRSWLVFA AVALVLVAAG LLFARDYGRS RALFSLAGQS RIDASLKASL LRAVVERQRA LPLVLADDAA IRGALLTPDR QSLDRINRKL EGLATSAEAA VIYLIDRNGT AIAASNWRDS TSFVGNDYAF RDYFRLAMRD GMAEHFAMGT VSKRPGLYIS RRVAGPGGPL GVIVAKLEFD GVEADWQASG KPAYVTDRRG IVLITSMPSW RFMTTKPIAR ERLAPIRESL QFGDAPLLPL PFRKVEARPD GSSTLDALLP GEANAAFLRV ETMVPSTNWR LEQLSPLKAP LAAGAREAQL ITLAALVPLL GLAAYLLRRR QVIAMRSAEE RLARSKLETS VEERTRDLRM ARDRLETEIA DHRQTTEKLQ AVQQDLVQAN RLAILGQVAA GVAHEINQPV ATIRAYADNA RTFLERGQSA TAAENMESIA ELTERVGAIT DELRRFARKG HFAAGPTAMK DVIEGALMLL RSRFAGRMDA IRIDLPPDGL QVFGNRIRLE QVLINLLQNA LEAIGDSGNG AIKVSCEETA TAVTLTVADN GPGIPADVRE ELFTPFNTSK EDGLGLGLAI SKEIVSDYGG TIEVASSPSG TTFIVNLMKA
|
| |