Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1231 |
Symbol | |
ID | 7399499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1241385 |
End bp | 1242515 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643708295 |
Product | histidine kinase |
Protein accession | YP_002565893 |
Protein GI | 222479656 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000541094 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTGACG ACGCTCTCAG CGCGATGGAA GCCAGAGAGC GGCTCTACGA GGTGATGGAC CGAGATGTCC CTTTTGAGGA GAAGGCGACG CTGGCGCTCT CCATCGGGGA GGCGTACCTG GGCGTGGAGA ACGGACATCT GACGCGGATC GACGTCGAGA GCGACTACTG GAAGGCGATC GCCAGTACGG ACTCGGTGGA CGGACGGTTC CCGGTCGGGC TCCAGTTAGA CCTCCAGAAC ACGTACTGCC GGCGAACGAT CGACGGCGAG AGCCCGGTTC GACTGTACGA CGCACCGAAC CAGGGGTGGG ACGACGACCC CGCGTTCGAG CGGCACGGAC TCCACTGTTA CCACGGCAGT ACGATCACCA TCGACGACGG GGTCTGCGGA ACGCTGTGTT TCGTCTCCAC GGAGCCGCGT CCAGAGCCGT TCGCCGACGA GGAGACGCTG TTCGCTGAGC TTATCGCGCG GCTGTTGGAA ACGGAGCTCC AAGCGGAACG GACGGAGGCA AAGATCGACC GGCTCGATCA GTTCGCCAGC GTCGTCTCCC ACGACCTCAG AAGCCCGTTG AACGTCGCAC AGGGCCGCGT CGATCTCGAA CGATCGACCC GCGACAGCGA CCACTTGGGG ATCGCCGCGA GATCGTTGGA CCGGATGGAG GACCTGATCG CCGACGTTCT CACGGTGGCC CGACAGGGAC AGGAGATCGG GGACACGGAA CTCGTCTCGC TCGACGCGAT CGTCACAGAG TGCTGGGACG CAGTGCAAAC CGACGGCGCG AACGTGACCG TTACCGACGA TCTGTGGTTT AAGGCGGACC GAGGGCGCGT CCGCCACCTC TTCGAGAACC TGTTCCGGAA CGGTGTCGAG CACGCGGGCC CGGACGTGTC GATCCGCGTC GGACCGCTCG ACAACGGAGA CGGATTCTAC GTCGAGGACG ACGGCCCCGG AATTCCGGCG GCCGATCGGG AGCAGGTCTT CGAGTCGGGG TACACGACGG GCACGGACGG GCTCGGACTC GGCCTCTCGA TCGTCGGGGG CGTCGTCGAC GCTCACGGGT GGACGATCGC GGTCGGAGCG GGCGCCGACG GCGGCGCGCG ATTCGAGGTT TCCGGCGTGG TCGTTCCGTA G
|
Protein sequence | MGDDALSAME ARERLYEVMD RDVPFEEKAT LALSIGEAYL GVENGHLTRI DVESDYWKAI ASTDSVDGRF PVGLQLDLQN TYCRRTIDGE SPVRLYDAPN QGWDDDPAFE RHGLHCYHGS TITIDDGVCG TLCFVSTEPR PEPFADEETL FAELIARLLE TELQAERTEA KIDRLDQFAS VVSHDLRSPL NVAQGRVDLE RSTRDSDHLG IAARSLDRME DLIADVLTVA RQGQEIGDTE LVSLDAIVTE CWDAVQTDGA NVTVTDDLWF KADRGRVRHL FENLFRNGVE HAGPDVSIRV GPLDNGDGFY VEDDGPGIPA ADREQVFESG YTTGTDGLGL GLSIVGGVVD AHGWTIAVGA GADGGARFEV SGVVVP
|
| |