Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1004 |
Symbol | |
ID | 7401899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 999136 |
End bp | 1001901 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643708070 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_002565671 |
Protein GI | 222479434 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.627886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGACC GAATCCGCGT GCTCCACGTC GACGACGACC CGGATCTCGC GGATATCACC GCGTTGTTCC TCGAACGCGA GGATCCTCGA ATCACGGTCG AGACGGTCTC AAACGCCACC GAAGGGCTCG AACGCCTCGA CGATCTCGAC CTCGACGCCA ATGTCGACTG TATCGTCTCC GACCACGACA TGCCGGGACC GAACGGGATC GAGTTCCTGG AGGCGGTCCG GGAGCGCGAC CCGGATCTCC CCTTCATCCT CTACACCGGG AAGGGGTCGG AGTCGGTCGC CAGCGAGGCC ATCTCCGCGG GCGTGACGGA CTACCTCCAG AAGGGCGGCG GAACCGAGCA GTACGAGATT CTGGCGAACC GCATCGTCGA CGCAGCCGAG AAGCGCCGGA TCGAGCGCGA GGCCGACCGG ACACAGGCCC ACCTGCGGGC GATCACCGAT CACTCGATGG ACGCCATCCT CACCATCGAC GGCGACAGCC GAATTCGGTT CGCGAACCCG GCCGTCGAGC GGCTGTTCGG CTACGCGCCC GCGGAGGTCG AAGGCGAGCC GCTGGCAACG CTGATGCCGG AGCGGAAGCG CGAGGCGCAC CGTGAGGCTG TCGAGCGGTA CTGCGCGACC GAGAAGCGCT CGATAGACTG GTCGGCGGCC GAATTTCCCG GGAAGCACTG GGACGGCCGT GAGATCCCCC TGTCGATATC TTTCGGGGAG TTCGAGGAGG ACGGCGAGCG ACGGTTCGTC GGCATCATCC GGGACGTGAC AGAACGCGAG CGACACCGCG CGTTCGTCGA GCGCTCCAGC GACATCGTCA CCGCGCTCGA CGCGAACGGG ACGTTCCAGT ACGCGAGCCC CTCGGTGGAG CGGATCCTCG GCCACGACCC GGCGGACCTG GTCGGTGAGT ACGCCTTCGA GTACGTCCAC CCGGAAGACC GCGAGCGAGT CGTCGAGGTG TTCGCGCAGT CGGTCACGGG CGACGAGCCG AACCCGACGG TGGAATACCG GCTCGCGGAT GCGGACGGAG GGTACCTGTG GGTGGAGTCG GTCGGGAGTA ACCGCCTCGA CGACCACGGA GTCAGCGGAT TCGTGATCAA CACACGCGAC ATCTCCGAGC GGAAGCAGCG CGAGGAGAAG CTGTCGCGGC TCCGCGAGTG GACGCGGGAC CTCAATTACA CGCGAACAGT CGCGGAGACG ACGCAGCTGG CCGTCGACGC CGCGGACGAA ATCGTCGGTG CGGGGCTGAG CGGGATCCAC CTGGTGAACG AGGCGGGCGA TGCGCTCGAA CCCGCCGCGC TCGCCGAGTC GGTGCCGTCG TTCTTCGACG AACAGCCCTC CTACGATCGG GACTCCCCGC CCGGGTCGCG CGCGGCCCTC GCGTGGGACG CGTACAGCGG CGACGAGCCG CTCTCCGTCG GCGACCTGTC GGCGTACGAT CGTCTCGACG AGGAGACGCC CGCCCAGAGC ATCGTGCTCC ATCCGATCGG CGACCACGGG CTGTTCGTCA TCTCCTCGTC GGAGCCGCAC GCGTTCACCG AGACTGACGT GCTCATCGCG GAGATCCTCG CGAACCATCT CGAAGCCGCG TTAGACCGGG TTGCTCGCGA GACCAGCTTA GAGCGGCTCC ACGACGCGAC CCGAAGTCTC ATCCAGGCTG ACTCCCCGAA GGAGATCGCC GAGCGCGTCG TCGAGGTGCT CGGGTTCTCC GTGGTCACCG TTCGGCTGTC TGACGAGGAC GCCGGCGGGC TCGTCCCCGT CGCGGTCTCG GAGGGGGTCA AAGAGGTATT GCCCGTGCGG AAGGTGTTCA CCCCTGACGG CGGGAGTCTC AACTGGGAGG CGTTCGAGGC GGGCGAACCC CGGATGTACG ACGACATCGA GACGGCGGGC GCGCTCGACA CTGGAACGGG GCTCCGGAGC CTGATGATCC TCCCCGTCGG CGAGCACGGA ACCATCTCTG TCGGCGAGAC CGAGCCCGGC GTGTTCGACG GGACCGACGA GCATCTCGCA CAGATCCTCG CGACGGCAGC CGAGACGGCG CTCAACGCGG CTGCGCGGAC CAGCCGTCTC CACGATCGGA GCGCGGAACT GGAGCGACAG AACGATCGGC TCGCGGAGTT CGCCAGCGTC GTCTCTCACG ACCTCCGGAA CCCGCTGAAC GTGGCGCAGG GACGGGTACA GCTGGCTCGC GACGAGTGTG ACAGTGAGAA CCTCGACGCC GCCGCGCGCG CCCACGAGCG GATGGACACG CTGATCGCCG ACCTGCTCAC GCTCGCTCGC GAGGGCGAGC GGGTGAGCGA GACGGAGTCG GTCCGGCTCT CGGTCGTCGT CGAGTCCTGC TGGGAGACGG TCGAGACCGC GAATGCGACG CTGGCCGTCG AGGAGGACCT GTGGCTCCGG GCCGACGAGA GCCGCTTCAG ACAGCTCGTC GAGAATCTGG TGCGGAACGC GGTCGAACAC GGCGGCGACG ACGTGTTGAT CACTGTCGGC GCTCTCGGCG GGAACAGAAA CGGCAGCGAA AACGGGTCCG TGACCGAGAC GAGCGACGAG GTCGGGTTCT TCGTCGAGGA CGACGGGCCC GGGATCCCGG AAGCGAACCG CGACGAGGTG TTCGACGCCG GTTACTCGAC CTCGCGAGAG GGGACCGGCT TCGGCCTCCG GATCGTCGAA CAGGTGGCGA CGGCCCACGG CTGGTCGGTC CGGGTCACGG AGGGTCGCGA CGGGGGCGCC CGGTTCGAGG TGACGGGCGT GGAGCCGGCT GAGTGA
|
Protein sequence | MGDRIRVLHV DDDPDLADIT ALFLEREDPR ITVETVSNAT EGLERLDDLD LDANVDCIVS DHDMPGPNGI EFLEAVRERD PDLPFILYTG KGSESVASEA ISAGVTDYLQ KGGGTEQYEI LANRIVDAAE KRRIEREADR TQAHLRAITD HSMDAILTID GDSRIRFANP AVERLFGYAP AEVEGEPLAT LMPERKREAH REAVERYCAT EKRSIDWSAA EFPGKHWDGR EIPLSISFGE FEEDGERRFV GIIRDVTERE RHRAFVERSS DIVTALDANG TFQYASPSVE RILGHDPADL VGEYAFEYVH PEDRERVVEV FAQSVTGDEP NPTVEYRLAD ADGGYLWVES VGSNRLDDHG VSGFVINTRD ISERKQREEK LSRLREWTRD LNYTRTVAET TQLAVDAADE IVGAGLSGIH LVNEAGDALE PAALAESVPS FFDEQPSYDR DSPPGSRAAL AWDAYSGDEP LSVGDLSAYD RLDEETPAQS IVLHPIGDHG LFVISSSEPH AFTETDVLIA EILANHLEAA LDRVARETSL ERLHDATRSL IQADSPKEIA ERVVEVLGFS VVTVRLSDED AGGLVPVAVS EGVKEVLPVR KVFTPDGGSL NWEAFEAGEP RMYDDIETAG ALDTGTGLRS LMILPVGEHG TISVGETEPG VFDGTDEHLA QILATAAETA LNAAARTSRL HDRSAELERQ NDRLAEFASV VSHDLRNPLN VAQGRVQLAR DECDSENLDA AARAHERMDT LIADLLTLAR EGERVSETES VRLSVVVESC WETVETANAT LAVEEDLWLR ADESRFRQLV ENLVRNAVEH GGDDVLITVG ALGGNRNGSE NGSVTETSDE VGFFVEDDGP GIPEANRDEV FDAGYSTSRE GTGFGLRIVE QVATAHGWSV RVTEGRDGGA RFEVTGVEPA E
|
| |