Gene Hlac_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0543 
Symbol 
ID7401678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp564523 
End bp565644 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content69% 
IMG OID643707608 
Producttranscriptional regulator, TrmB 
Protein accessionYP_002565215 
Protein GI222478978 
COG category[K] Transcription 
COG ID[COG1378] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0100454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACC GGACGCTGAA CGATCTTCTC CGCCGGTTCG GGCTCTCGGA CAAGGAGGTC 
GACACGTACC TCAGTCTCTT GGCGCACGGG GAGGCGAAGG CGAGCACCGT CGCAGACGCC
GCCGGTGTGT CGAAGCGCTA CGTCTACAGC GTGAGCGAGT CGCTCGCTGA GCGCGGCTTC
GTCGAGGTAA ACGACCACGT CGTGCCGACG ACGATCCGCG CGAACCCGCC GGACGAGGTC
ATCAACCGCC TCCGTTCGGA CGTCGACGCG ATCCGCCCCG GGTTAGAGGA GCGCTTCTCG
CGGGTGGAGC GGCAGACCGA GCAGTTCGAG GTGATCAAGT CCCGCGTGAC GGTGATAAAG
CGGATCCGGT CGCTGCTCGC GGACGCGGAC TCGGAGGTGA CGCTGTCGAT CGCGGCCGGT
CACCTCCCCG AGATCCGCGA CTCCCTCGTC GAGGCGGTCG ACCGCGGCGT CTTGGTACTG
CTCATCGTCT CCGGCGCCGA CGAGGTGCCG GACGACATCG ATGAGGGACT CGACGGCGTC
GCCAGCGTCG TCCGGACGTG GCGCGAGGCG ATGCCGACGC TGCTCACGGT CGACTCCGCG
GCCGGCGTCG TCGCCCCGCC CGAACTGCTG CGCCGGTCCA ACACCGACCG GCAGGCGATC
CACTTCTCAC AGGAACAGCT CGCGCCGGTG ATCGTCGGCT CGTTCCTCGG GAACTACTGG
CCGGCCGCGA ACGAGATCGC GACCGCGGCG CCCGCGCCGC TCCCGGTCGA GTACGCGAAC
TTCAGACACA CCGTACTGCA GGTGACCCTG CGCCTCCGCG TCGGCGAGAT TCCCCGCGTC
ACCGTGGGCG GCCGGTGGAC TGACCGCGAC GAGCCGGCCG AGATCAGCGG TCGCGTCGTG
GAGTCGAAAC AGGGAATGGT GGAGCCGACG ACCAACGAGT TCCCAGTCCA ACACTCGCTC
GTCGTCGAGA CCGACGACGG CAAGACCGTC ACGGTGGGCG GGCAGGGGGC CTTCGTTGAG
GACATCGAGG CCGACCTCGT TCGGATCGAG GAAGACGACG GAGACCACGA GGAGGCGGAC
GCGGGAGAGT CCGACGAGGC GGACGGAGCA GACGGCGTCT GA
 
Protein sequence
MDDRTLNDLL RRFGLSDKEV DTYLSLLAHG EAKASTVADA AGVSKRYVYS VSESLAERGF 
VEVNDHVVPT TIRANPPDEV INRLRSDVDA IRPGLEERFS RVERQTEQFE VIKSRVTVIK
RIRSLLADAD SEVTLSIAAG HLPEIRDSLV EAVDRGVLVL LIVSGADEVP DDIDEGLDGV
ASVVRTWREA MPTLLTVDSA AGVVAPPELL RRSNTDRQAI HFSQEQLAPV IVGSFLGNYW
PAANEIATAA PAPLPVEYAN FRHTVLQVTL RLRVGEIPRV TVGGRWTDRD EPAEISGRVV
ESKQGMVEPT TNEFPVQHSL VVETDDGKTV TVGGQGAFVE DIEADLVRIE EDDGDHEEAD
AGESDEADGA DGV