Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0543 |
Symbol | |
ID | 7401678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 564523 |
End bp | 565644 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643707608 |
Product | transcriptional regulator, TrmB |
Protein accession | YP_002565215 |
Protein GI | 222478978 |
COG category | [K] Transcription |
COG ID | [COG1378] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0100454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACC GGACGCTGAA CGATCTTCTC CGCCGGTTCG GGCTCTCGGA CAAGGAGGTC GACACGTACC TCAGTCTCTT GGCGCACGGG GAGGCGAAGG CGAGCACCGT CGCAGACGCC GCCGGTGTGT CGAAGCGCTA CGTCTACAGC GTGAGCGAGT CGCTCGCTGA GCGCGGCTTC GTCGAGGTAA ACGACCACGT CGTGCCGACG ACGATCCGCG CGAACCCGCC GGACGAGGTC ATCAACCGCC TCCGTTCGGA CGTCGACGCG ATCCGCCCCG GGTTAGAGGA GCGCTTCTCG CGGGTGGAGC GGCAGACCGA GCAGTTCGAG GTGATCAAGT CCCGCGTGAC GGTGATAAAG CGGATCCGGT CGCTGCTCGC GGACGCGGAC TCGGAGGTGA CGCTGTCGAT CGCGGCCGGT CACCTCCCCG AGATCCGCGA CTCCCTCGTC GAGGCGGTCG ACCGCGGCGT CTTGGTACTG CTCATCGTCT CCGGCGCCGA CGAGGTGCCG GACGACATCG ATGAGGGACT CGACGGCGTC GCCAGCGTCG TCCGGACGTG GCGCGAGGCG ATGCCGACGC TGCTCACGGT CGACTCCGCG GCCGGCGTCG TCGCCCCGCC CGAACTGCTG CGCCGGTCCA ACACCGACCG GCAGGCGATC CACTTCTCAC AGGAACAGCT CGCGCCGGTG ATCGTCGGCT CGTTCCTCGG GAACTACTGG CCGGCCGCGA ACGAGATCGC GACCGCGGCG CCCGCGCCGC TCCCGGTCGA GTACGCGAAC TTCAGACACA CCGTACTGCA GGTGACCCTG CGCCTCCGCG TCGGCGAGAT TCCCCGCGTC ACCGTGGGCG GCCGGTGGAC TGACCGCGAC GAGCCGGCCG AGATCAGCGG TCGCGTCGTG GAGTCGAAAC AGGGAATGGT GGAGCCGACG ACCAACGAGT TCCCAGTCCA ACACTCGCTC GTCGTCGAGA CCGACGACGG CAAGACCGTC ACGGTGGGCG GGCAGGGGGC CTTCGTTGAG GACATCGAGG CCGACCTCGT TCGGATCGAG GAAGACGACG GAGACCACGA GGAGGCGGAC GCGGGAGAGT CCGACGAGGC GGACGGAGCA GACGGCGTCT GA
|
Protein sequence | MDDRTLNDLL RRFGLSDKEV DTYLSLLAHG EAKASTVADA AGVSKRYVYS VSESLAERGF VEVNDHVVPT TIRANPPDEV INRLRSDVDA IRPGLEERFS RVERQTEQFE VIKSRVTVIK RIRSLLADAD SEVTLSIAAG HLPEIRDSLV EAVDRGVLVL LIVSGADEVP DDIDEGLDGV ASVVRTWREA MPTLLTVDSA AGVVAPPELL RRSNTDRQAI HFSQEQLAPV IVGSFLGNYW PAANEIATAA PAPLPVEYAN FRHTVLQVTL RLRVGEIPRV TVGGRWTDRD EPAEISGRVV ESKQGMVEPT TNEFPVQHSL VVETDDGKTV TVGGQGAFVE DIEADLVRIE EDDGDHEEAD AGESDEADGA DGV
|
| |