Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5576 |
Symbol | |
ID | 5319878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 541185 |
End bp | 542774 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640777323 |
Product | histidine kinase |
Protein accession | YP_001314255 |
Protein GI | 150377660 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.44664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0252292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAGTG AAGCCGCGGT GCCCTCGATT TCAATTCCCG ATTTAGAGAC CTTAGCAACA TCAATCCATG ACGATACAGC CTTCGTCGTC TTGACTGAGG AGGTGTTGCG GGGACGGGAT ATCGGGAGGA TCTCGACATT GCTGGCAAAT CAGCCGAGCT GGTCGGACCT CCCCTTTATC ATCCTCACTT CGCGTGGGGG GCCGGATCGC CATTCCAGTG CCCGACTTTC AGAATCACTC GGCAACGTCA CTTTTCTCGA GCGGCCTTTC CACCCCACAA CATTCATTAG CGTCGCTCGC TCGGCTATGA AGGGACGGCG TAGGCAGTTT GAAGCGCGTA CCAGGCTTGA GGAGATCAGT CGCCTCAATG AGACGCTTGA GGAGCGCGTG GCGATACGCA CTGCAGAATT GCAGCGCGCG AACAGAGTTC TCTCTGAGCA GATTGCACAG CGCGAAGATG CCGAAGAGAG ACTGCGGCAA TCTCAAAAAC TTGAAGCCAT CGGCCAGTTG ACTGGCGGCG TTGCCCACGA CTTCAACAAT CTCCTCATGG CAGTGCTCGG CAATCTTGGC CTGCTATCTA AATATGTCTC TCACGACCCG AATGCAGCCC GTCTCCTTGA AGGGGCGACG AGAGGGGCGC AAAGAGGGGC GGCACTTACC CAGAGGCTAC TCGCATTTGG ACGCCGACAG GACCTGACTG TTAGACCCAC AGATATGGTG GGTCTCATCA TTGGTATGGA TGATCTTTTG ATCCGATCAA TAGGCCAGAA TATTGAGCTC GAGAAGCACC TGCCACGGCA GTTACCCAGG GCGCTGATAG ATGCCAACCA AGTCGAACTG GCGCTGCTTA ACCTTGCGAT CAACGCGAGA GATGCAATGC CAAGCGGCGG AAAGCTGGTG CTCTCTGTGA GGCAGGAACG TCTCTCTGCC ACGCGAGGCG AGTTGTGTGC ACGCGAATAT CTTGTCCTTT CGGTTTCGGA CACGGGCCAT GGCATGGACG CGGCAACCCT CAAGAGAGCT ATAGATCCGT TTTTCTCAAC CAAGGGGCCG GGTAAAGGCA CGGGGCTCGG ACTCTCAATG ATCCACGGTG TTGCGGTCCA AATGAACGGC GCGCTGGAAC TTACTAGCGT ACTCAACGAA GGTACGACCG CTGAGTTGTG GTTTCCGGCG ACTTCAGAGG CCACGCTTGA CGAACCGGTC AAACCGCCGG TCGCATCATC CGAAACCGCA AAGTTGCTAC GAGTTTTGTT GGTAGACGAC GACGCACTTA TCGCGATGAG TTCAGTCGAT ATGCTGGTGG ACTTGGGACA TACTGTAACT GAAGCCAATT CGGGCAAAGC GGCTTTGGCG CTGCTTGAGG CGGGTAACGA GTTCGATCTT ATGATCACCG ATTACTCGAT GCCTGGAATG AATGGTGCCG AACTCGCTCG CGCCGCACTA CTGCTTGCAC CGAAAATGCA AATTCTTGTC GCATCTGGGT ATGCGGAACT TCCATCAGGC GCGGGTATCG ATCTTCCCAA GCTTGGAAAA CCATATAGTC AGTCGCAACT TGCGGATGAG ATAAGCAAGT TGTTCGCCGG GGATGAGTAA
|
Protein sequence | MLSEAAVPSI SIPDLETLAT SIHDDTAFVV LTEEVLRGRD IGRISTLLAN QPSWSDLPFI ILTSRGGPDR HSSARLSESL GNVTFLERPF HPTTFISVAR SAMKGRRRQF EARTRLEEIS RLNETLEERV AIRTAELQRA NRVLSEQIAQ REDAEERLRQ SQKLEAIGQL TGGVAHDFNN LLMAVLGNLG LLSKYVSHDP NAARLLEGAT RGAQRGAALT QRLLAFGRRQ DLTVRPTDMV GLIIGMDDLL IRSIGQNIEL EKHLPRQLPR ALIDANQVEL ALLNLAINAR DAMPSGGKLV LSVRQERLSA TRGELCAREY LVLSVSDTGH GMDAATLKRA IDPFFSTKGP GKGTGLGLSM IHGVAVQMNG ALELTSVLNE GTTAELWFPA TSEATLDEPV KPPVASSETA KLLRVLLVDD DALIAMSSVD MLVDLGHTVT EANSGKAALA LLEAGNEFDL MITDYSMPGM NGAELARAAL LLAPKMQILV ASGYAELPSG AGIDLPKLGK PYSQSQLADE ISKLFAGDE
|
| |