Gene Hlac_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0107 
Symbol 
ID7401625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp112378 
End bp113943 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content64% 
IMG OID643707168 
ProductDNA-directed RNA polymerase subunit beta'' 
Protein accessionYP_002564783 
Protein GI222478546 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.397905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.532786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGC AAGACCGACG CGTGGTGTCA CGTGAGTATT TCTCGGACGA ACGGCTCGCC 
GAACACCACT TCCGATCGTT CAACAACTTC CTGGACCGCG GCATGCAGGA GGTCGTCGAC
GAGAAGGAGA CGATCGAGAC CGATATCGGC GACAAAGAGG GCCAAGAGCC CGTCTACGTC
GAACTCGGCG ATGTGCGGAT GGTCACTCCG CGCGTCCGCG AGGCCGACGG CTCCGAAGAG
CTGCTGTACC CCCAAGAGGC CCGGCTTCGG AACATCACCT ACTCGGCGCC CGTGTTCATG
GAGATGTCCA TCGTGCGCGG CGGCGAAGAC GAGCCCGAGC AGGTCGTCGA CACGACCGAG
ACGAAGGTCG GCCGGATGCC GATCATGGTC GGCTCGAACA AGTGCAACAT GGCCGGCTTC
TCCGACGACG AGCTCATCGA CATCGGTGAA GATCCCGTCG ACCCCGGCGG CTACTTCATC
GTCAACGGCT CCGAGCGCGT GCTGATGACC TCGGAGGACC TCGCGCCGAA CAAGATCCTC
GCCGAGTACG ACTCGAAGTA CGGCGACGAG ATCCAGGTCG CAAAGACGTT CTCCCAACGC
CGCGGGTACC GTGCGCTGGT GCTTTGCGAG CGCAACCGTG AAGGGCTGCT CGAAGTGTCG
TTCCCGTCCG TCTCGGGCTC GATTGACTTC GTGACCCTCG TTCGCGCCCT CGGGCTCGAA
TCCGACGAGG AGATCGTTCA CCGCGTCTCG GACGACCCCG AGATTGTGAA GTTCATGCTG
GAGAACTTGG AGGAGGCCGA GGTGCAGACG ACCGAGGGGG CCATCGAAAC CCTCGGCGAG
CGCGTCGCCT CCGGACAGGG GAAGAACTAC CAGCTCAAGC GGGCCAACTA CGTCATCGAC
CGCTACCTCC TCCCGCACCT CCACGAGGAG GGCGTCGACG AGGAGGACGT GCGGATCAAC
AAGGCGTACT ACCTCTGCCG GATGGCCGAG GCGTGCTTCG AACTCGCCTT GGAGCGCCGC
GAGGCCGACG ACAAGGACCA CTACGCGAAC AAGCGCCTGA AGGTCTCCGG CGACCTGATG
CGCGACCTGT TCCGGACCGC GCTGAACAAG CTGGCACGCG ACGTGAAGTA CCAGCTTGAG
CGCGCGAACA TGCGGAACCG CGATCTCACG GTCAACACGG TTGTCCGCTC CGACGTACTG
ACCGAGCGGC TCGAACACCC GATCGCGACG GGGAACTGGG TGGGTGGTCG CTCCGGCGTC
TCCCAGCTCG TTGACCGGAC GGACTACATG GGTGTGCTCT CGCACCTCCG GCGCCTGCGC
TCGCCGCTGT CGCGGTCGCA GCCGCACTTC AAGGCGCGAG ACCTCCACGC GACCCAGTGG
GGTCGCATCT GTCCCTCCGA GACTCCGGAG GGGCCGAACT GTGGACTCGT GAAGAACTTC
GCGCAGGCGA TGGAGCTCTC ACAAACCGTA GACGACGAAC AGGGGCTGAA ACGAGAACTG
GCGTCGATGG GTGTCGAGGG GATTCCCGGC ATCGAGGGCG TCGACCGACA GACGGCGGAC
GACTAA
 
Protein sequence
MNRQDRRVVS REYFSDERLA EHHFRSFNNF LDRGMQEVVD EKETIETDIG DKEGQEPVYV 
ELGDVRMVTP RVREADGSEE LLYPQEARLR NITYSAPVFM EMSIVRGGED EPEQVVDTTE
TKVGRMPIMV GSNKCNMAGF SDDELIDIGE DPVDPGGYFI VNGSERVLMT SEDLAPNKIL
AEYDSKYGDE IQVAKTFSQR RGYRALVLCE RNREGLLEVS FPSVSGSIDF VTLVRALGLE
SDEEIVHRVS DDPEIVKFML ENLEEAEVQT TEGAIETLGE RVASGQGKNY QLKRANYVID
RYLLPHLHEE GVDEEDVRIN KAYYLCRMAE ACFELALERR EADDKDHYAN KRLKVSGDLM
RDLFRTALNK LARDVKYQLE RANMRNRDLT VNTVVRSDVL TERLEHPIAT GNWVGGRSGV
SQLVDRTDYM GVLSHLRRLR SPLSRSQPHF KARDLHATQW GRICPSETPE GPNCGLVKNF
AQAMELSQTV DDEQGLKREL ASMGVEGIPG IEGVDRQTAD D