Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4266 |
Symbol | |
ID | 5705771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4839574 |
End bp | 4842669 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641273685 |
Product | histidine kinase |
Protein accession | YP_001539038 |
Protein GI | 159039785 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.159285 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACCG GACCGACGAC CCTGCCCGCA GGCGGCGACA TTGACCAGCC GAAGCGGCAC TGGCTGCCCC GGCTTCGCAA CGCACGAATC CGCTCGAAGC TCGCGTTGAT CCTGGTCGTG CCGGTCGCCG CGGTCATCGC GCTGGCAACT GTTCGTCTGG TCACAGTAGG TGAGGGCGCG TATGACGCCA CGCGGGCCAA AGCGCTGACC GAACTCTCCA TCGACATCTC CGCACTCGCC CATGACATGC ACGCCGAACG GATGGCGGCT ACGGTCTATC TCGCCTCGAC GAAAGAGACC GCCGACGCCT ACAACCTGCG GGTGCGCAGC ACCGACGAGC GGGTGCAGGC GTACCGGGAG GAACGCGAGC GGATTGGCGA GGTGCCTTCC GCCGTCAGTG ACCGGTTGGT CGCGATTGAC GAGCACCTGA CGACGCTGGA CGCGGCCCGG CAACAGGTGC TGGACCGACG GCAGATGGCG GTTTCCGAGG CGGCGTTACG GTACGGCGTC ATCCTGGCTG ACCTCGTGGC GTACGGCGAT GGCCTCGCTC AACTCCCCGG TGATGAGCAG CTGGCGGACG CCCGGCGGGC GGTCGCGGCC TTCGGCCGCG CCAAGGCGGC GGTCGCCGAG CAGGAGTCCG TCGCCTACAC CGCGTTGAGC GTCGGCAGCT TCGACGAGGA GCAGTACTCC TCCTTTGTGG CCACCTTGAC CGGTCAGCAG GAGGCGTTGC TCGCCTTCTC ACTCGCGGCC AGCCCGAGTC AACGCTCGTT CGTGGACAGC ACCGTCTCGG GTGACGCGGT CACCCTGGCC GACAAGGTCG CCGCGGACAT CACCCGCTCG GTCGGGCAAC GGTCCCTGGT GAGCGCGGCG GACGCCAGTG CCGCCATCGG TGCCGTCAAC GACCTCATGC GGTGGACCGA GGCCCGCCTT CAGGAGCGGT TGCTGGCCGA CACCGAACAG ACCCGGGCGA ATGTCCTCCG GCAGGCGATC GTCGAGTCGC TGCTGGGGCT GTTGACTCTG ATCATCGCCA TAGCCCTCGC CGTGGTGCTG GCCCGTTCGC TGAACCACTC GCTGCGCCTG TTGCGGGAGG GCGCCCTGGC CGTGGCCCAC CGTGACCTGC CGGAGGCGGT GCGTCGGCTG CAGAGCATGC AGGCCGTTGA CGAGGGCGGG GTCGACGACA TCGTCCGTGA GGTGCGGGAA CCGATCCGGC TCAACAACCA GGACGAGGTC GGTCAGGTCG CGCTCGCCTT CAACGTGGTG CACCGGGAGG CTGTCCGGGT AGCGGCTGAA CAGGCAGCCC TGCGGACCAG CGTTTCGGCC ATGTTCCTCA ACCTGGCCCG ACGGAGTCAG AACCTGGTTG ACCGCATGAT CGGAGAGCTG GACGCGATCG AGCGTGGCGA GGAGGACCCG AAGCGTCTGG CCCGGCTCTT CGAACTGGAC CACCTGGCGA CCCGGATGCG CCGCAACGAC GAGAACCTGC TGGTCCTCGC GGGGGCCGAC TCGACCGTGC CCCGGCGGGA GGACGCTCTG CTGGTGGATG TGTTGCGGGC CGCGCAGTCC GAGGTGGAGC TCTACAACCG GATCGAGTTC GGCACCGTTG ACACCGATGT CTCGGTGGCC GCCCACGCGG TCAACGACGT GGTCCGACTC GTCGCCGAAC TACTCGACAA CGCCACCCGG TTCTCGCCAC CGAACACCAC GGTGGTCGCC GACGGGCGGC GGATCCGCGA CTATGTGCTC ATTCAGGTGG AGGACCGTGG CCTCGGCCTC TCCGACGAGC AACTCGAATC GCTCAACCGG CGGTTGGCCG AGCCATCGAG CGTGGATGTC GCGGCATTCC GGCTGATGGG CCTGGCCGTG GTGAGCCGGC TCGCCGACCG GTACGGCATC CGGGTCGAGC TACGCCGCAA CGTCGAGGGT GGCACGGTCG CCCAGGTGAC TCTGCCAACG GCCACGGTCG TCCTGCCCGT TGGCCGGGGA CCGGCCCAGA TCAGCCGGCC CCGTCAGCCG CTCGCGGTGG AGCAGGGTCC GTCCACCCCG ACCGGCCTGG GCGGTCCGTT GGTCGGTGCC ACCCGGGCTG CCACCCTGCC GGAGCAGCGG CCGCCCGAGC CGGCTCCATG GCAGGCGCCC GAGCCAGCTC CGTGGCAGGC GCCGGAGCAG GCCCGCAGCG CAACGGCACC GGTGCAAGCC GGCGGAATGG GCGGAATGGT CGGGGCGTCG CCAGGCCTCG GTGCTACGGG CCACCCCGGT GATGCGCCAA CCGCGGCCTA TCCGCTTCCC CAGCGGAATC CGTCACGCGA TTCGTCGGCG GCGACCGCTG GCTTCCCCAC CGTGCCAGGC AGTACCCCGC CGTTGACCGA CTACGGATCG ACCGGGGGCC TCGGCGCCGA CCTGGCCGCT ACCGCGTCAT TCGCCTCCAC CCCGCTGGAC GTGCCACCAG CCGCTCCACC AGCCCCTCCG CAAGCCGCTC CACCGGCAGA GGCGCCGATC TTCCGGGAGA TGGAGGCGGT CTGGTTCCGG TCGCACGGCA ACGACGCCAC CGCCATCTTC ACTCGGCCGG ACTTCGACGG CGCGGCCCAA CCACCGGCCC CGGACTGGTC GGCGACGGCA GGTGGCCCGG CCGGGCCACA GCTGCCCACC CGGGTGCCGG GTGCCACGAC GACTCCGTCG CCAGTTGGCC CGCCGCCGTA CACCGCGCCA ACCGGGGCTA CCGCCGCGCC CACGAGCCCC ACCACGGCGT CTCCCGTCGG CCCGCCGCCC GCCGCGACCC CGACCGGCGT CCCGACGGCC GCGTCAGGTG CCCCGGGGGC GTCGGCGACC AGCGCCGACG CGTGGCGCAC CGTTGCGGAC GACGGCTGGA GCCGGGCCAG TCGGGCCGCC GAGCCGGCCA GCGGTGGTAC GACCCGTTCC GGCCTGCCGA AGCGGGTGCC GAAGGCGCAG CTCGTGCCCG GCGGCATCGA GCCGCGGGCC CGGGAACGCA CTCGCCGGAC ACCGGACGAA GTCCGCGGTC TGCTGTCGGC CTATCACCGC GGTGTGCAAC GCGGCCGAGC GGCCGGCTCG GACCCCAACA GCACCTCGAG CAAGGAGACG AGCTGA
|
Protein sequence | MSTGPTTLPA GGDIDQPKRH WLPRLRNARI RSKLALILVV PVAAVIALAT VRLVTVGEGA YDATRAKALT ELSIDISALA HDMHAERMAA TVYLASTKET ADAYNLRVRS TDERVQAYRE ERERIGEVPS AVSDRLVAID EHLTTLDAAR QQVLDRRQMA VSEAALRYGV ILADLVAYGD GLAQLPGDEQ LADARRAVAA FGRAKAAVAE QESVAYTALS VGSFDEEQYS SFVATLTGQQ EALLAFSLAA SPSQRSFVDS TVSGDAVTLA DKVAADITRS VGQRSLVSAA DASAAIGAVN DLMRWTEARL QERLLADTEQ TRANVLRQAI VESLLGLLTL IIAIALAVVL ARSLNHSLRL LREGALAVAH RDLPEAVRRL QSMQAVDEGG VDDIVREVRE PIRLNNQDEV GQVALAFNVV HREAVRVAAE QAALRTSVSA MFLNLARRSQ NLVDRMIGEL DAIERGEEDP KRLARLFELD HLATRMRRND ENLLVLAGAD STVPRREDAL LVDVLRAAQS EVELYNRIEF GTVDTDVSVA AHAVNDVVRL VAELLDNATR FSPPNTTVVA DGRRIRDYVL IQVEDRGLGL SDEQLESLNR RLAEPSSVDV AAFRLMGLAV VSRLADRYGI RVELRRNVEG GTVAQVTLPT ATVVLPVGRG PAQISRPRQP LAVEQGPSTP TGLGGPLVGA TRAATLPEQR PPEPAPWQAP EPAPWQAPEQ ARSATAPVQA GGMGGMVGAS PGLGATGHPG DAPTAAYPLP QRNPSRDSSA ATAGFPTVPG STPPLTDYGS TGGLGADLAA TASFASTPLD VPPAAPPAPP QAAPPAEAPI FREMEAVWFR SHGNDATAIF TRPDFDGAAQ PPAPDWSATA GGPAGPQLPT RVPGATTTPS PVGPPPYTAP TGATAAPTSP TTASPVGPPP AATPTGVPTA ASGAPGASAT SADAWRTVAD DGWSRASRAA EPASGGTTRS GLPKRVPKAQ LVPGGIEPRA RERTRRTPDE VRGLLSAYHR GVQRGRAAGS DPNSTSSKET S
|
| |