Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3764 |
Symbol | |
ID | 5319056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 207968 |
End bp | 210721 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640775577 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001312510 |
Protein GI | 150375914 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.432635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCGT TAGGGCACGG GCTGCGATCG ACAGAGGCCG TTCGGCCCAC GAAAAGGCTC GACAGCCGGA TCATCGATCT GCCGATCTCT GCCCGCGTGG CCGCGCTTAC GGCGGCCGGA CTGGTCAGCC TTGTTCTTTC ATCCCTTTTT CTCACGCAGG CACTTTATCG CAGTGCCGAG CGCATGGCCG AAACGAGAGA GCTCTTCGAC AGGGCGGCGT CGGCAGCCTC GGCCCATGTG GCGTTCGGCG ATCTCAGATA TTGGCTGACC GACCTGTCCG TCAGCTTGCT GATGAATTCC CAGCGCAACG CCGACGAGGC ACGCGCGCGA CTACAGGTGC AGCTGGAACG GCTTGCAAGG CATGCACCCG TGGAGGTCCG GCAAATCGGC GCCGAGGTCG ACGCTTATAT GGAAACGGCG CTGCAGGCAG CCGACAGCTA CACGCAGGAT AACCGCATCG TCGGCAATGC CCTGCTGGCA AAGGCGCGCA CCCACAGCGG CAAGGTGGAC ACAGACCTCA ACAGGCTGGT CGAACAGGTA AAGGCAGAGG CGGAGGCCGC CCGCAACGAG GTCGTCGTTC AAACCGAGAA GACGGCCACG ACCGCGGCCA TCCTGGTGGG CGGCGTGGTG CTGATCGGCT CGCTGCTCAC GCTCCTCGTC CTGCGCTCGA TCGTGGGCCC TCTGCGGCGC CTCAATCGAG TGATCGGCGA TCTGACGGAA GGGCGCTACG ACGTCGAGAT ACCTCAGGAG GGCGGTGACG AATTTGGGGC GATGGCAAGG ACACTATCGC TTTTCCAGGA AAGCGCGATC GAAAAGAAGA AGCTCGAAGA CGAGGCGGAG CGGCAGCGGC GGACGATCGC GGCCGCGCTC GAGGTGATTT CGGATGGCTT CGTCCTTTAT GATTCCGATG ACAGGATCCT GGTCGCCAAC AGCAAATATT GCGAAATTTT CCCGAGCCAC AAGCCCAATA CGCTTCGCGG CAGGAGCTTC CGCGAAATCG TGGAGCAGAA CCTGGAGACG GGGCAGGTGG ACCTCGAAGG AAAGTCGCCG CAGGAGTGGG TCGACGAGCG CGTAAGGCTT CATAGGGATC CCGCGGGCCT CGTCGACGAG AAGCGGTTTG GCGAAAGATG GGTTCGTATC AGCAAACGCA AAATTCCGGA TGGTGGAACG GTGGCCGTCT ACACCGACAT CACGGAACTC AAGCAGCGGC AGGTCGAACT GGAACGCGCA AAAAGCCATG CGGAATCGGC CAACGAGGCC AAGAGCCAGT TTCTCGCCTC CATGAGCCAC GAGTTGCGCA CGCCTCTCAA CGCGATCATC GGCTACAGCG AAATGCTGAT CGAGGAGGCA CGCGATCACA AGGAAGAGGA GCTCGTGCCG GACCTCGACA AAATAGCCTC GGCGGGACGG CACCTCCTCA CGCTTATCAA CGACATCCTC GATCTGTCGA AGATCGAGGC GAACAAGATG GATGTTTTCC TCGAGACCTT CAGTCTTGCC GAGTTGCTCC GTGATGTTGC CGCAACAGTC GCGCCGCTGA TGGCCAGGAA TGGAAACGCG TTCACGCAGG ATTTCGAGGA CGATCTCGGC AAGATGCATT CCGATCAGAC CAAATTGCGC CAGAACCTCT TCAACCTGCT CAGCAATGCG GCCAAGTTCA CGAAAGGCGG GCGGGTGATC CTGAGGGTAC GGCGGGAACA AAGAGCGGAG CACGACTGGC TCGTGTTCCA AGTCTCCGAT ACCGGCATAG GCATGACGCC GGAGCAGCAG GATAGGCTCT TCAACGCCTT CACCCAGGCC GATGCCTCGA CCACCCGAAA CTATGGCGGC ACCGGGCTGG GTCTGAGCAT CACGCGCAGC TTCTGTCGCT TGATAGGCGG TGTCGTCACG GTGGAGAGCG AAGCGGGAAA AGGCTCGGTA TTCACGATGG AAGTGCCCGC GAAGTGCGAG AGCGAGGTGG ATCGCCCACC ACCCGAGCAG GCAGCGCAGG TTCGGTCCGG ACGCACCGCA CTCATCATCG ACGACGAACC GGCTGCGCGC AATCTCATCG CAAAGGCGCT GGCAGAAGCA GGCCTTGCCG CGATCGAGGC ATCAAACGGA CAAGAAGGCC TGGCCGCCGC CCGAGAGCAT CGGCCCGATG CCATCATTCT CGACATCATC ATGCCGCATC AGGACGGCTG GTCGGTCCTG CGCGCTTTGA AGAACGATCC CGAATTGTGC ACGATCCCGG TGATCCTGGC GACGATCCTG GCGGATCGCG AGCTGGGACT TTCGCTCGGC GCGGTCGAAT ATCTGACGAA GCCCATCGAT ACTGACAAAC TTGTCCGGAC AATCGAGACA TTCGGGGGCG GCAAACACGA CGTTCTCGTC ATCGACGACG ACCAGGGCTC ACGCGAGTTC CTTCGGCGCA TTCTGGTCAA GAGGAAATGG ACCGTACACG AGGCGGGCGA CGGCGTCCGC GGGCTGGAGA TGATGAAGCG GCTTCTGCCG CGTCTCGTCC TGCTGGACCT GCTGATGCCA GAAATGGACG GATTCCAAAC GCTGAGCGAA ATGCAGAGTA TCCCGGAATT GCAGAACATC CCCGTCGTCG TGGTCACTTC GAAAGATCTT TCCGCAAACG AGCTGAAATG GTTGCGCGAT CGGGCGGTCG CAGTCGTGAA CAAAGGCGCA AACAGCCGGG CTCAGCTGGT GGAGGCACTC GAACGCCAGA TCGGATCGGT CGCTTTGATT GGCAAAGAAC TGGCCGAAGG CTAA
|
Protein sequence | MSALGHGLRS TEAVRPTKRL DSRIIDLPIS ARVAALTAAG LVSLVLSSLF LTQALYRSAE RMAETRELFD RAASAASAHV AFGDLRYWLT DLSVSLLMNS QRNADEARAR LQVQLERLAR HAPVEVRQIG AEVDAYMETA LQAADSYTQD NRIVGNALLA KARTHSGKVD TDLNRLVEQV KAEAEAARNE VVVQTEKTAT TAAILVGGVV LIGSLLTLLV LRSIVGPLRR LNRVIGDLTE GRYDVEIPQE GGDEFGAMAR TLSLFQESAI EKKKLEDEAE RQRRTIAAAL EVISDGFVLY DSDDRILVAN SKYCEIFPSH KPNTLRGRSF REIVEQNLET GQVDLEGKSP QEWVDERVRL HRDPAGLVDE KRFGERWVRI SKRKIPDGGT VAVYTDITEL KQRQVELERA KSHAESANEA KSQFLASMSH ELRTPLNAII GYSEMLIEEA RDHKEEELVP DLDKIASAGR HLLTLINDIL DLSKIEANKM DVFLETFSLA ELLRDVAATV APLMARNGNA FTQDFEDDLG KMHSDQTKLR QNLFNLLSNA AKFTKGGRVI LRVRREQRAE HDWLVFQVSD TGIGMTPEQQ DRLFNAFTQA DASTTRNYGG TGLGLSITRS FCRLIGGVVT VESEAGKGSV FTMEVPAKCE SEVDRPPPEQ AAQVRSGRTA LIIDDEPAAR NLIAKALAEA GLAAIEASNG QEGLAAAREH RPDAIILDII MPHQDGWSVL RALKNDPELC TIPVILATIL ADRELGLSLG AVEYLTKPID TDKLVRTIET FGGGKHDVLV IDDDQGSREF LRRILVKRKW TVHEAGDGVR GLEMMKRLLP RLVLLDLLMP EMDGFQTLSE MQSIPELQNI PVVVVTSKDL SANELKWLRD RAVAVVNKGA NSRAQLVEAL ERQIGSVALI GKELAEG
|
| |