Gene Hlac_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0056 
Symbol 
ID7401411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp56851 
End bp58905 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content66% 
IMG OID643707117 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_002564732 
Protein GI222478495 
COG category[R] General function prediction only 
COG ID[COG1202] Superfamily II helicase, archaea-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000344856 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGTCACAGC AGTTGGCCGA AGTCGAGACG CTGTTCCTCC ACGAGGCGCG GAGCGACTAC 
ACCGTTGTCG CCACCCGCGA CGGCAATCGG ATACTCCGCG GGCGGCTCGA ACTCAAGGAG
ACCTCCGCAG GCCCGCGACC GGGCAAGTTC CGTGTCATCC GCGACGGCGA GGACCATCCC
CGCCAGCCCG GCGAATTCGT TGATCTCGCC CGCGCGGCCG GTCGGATTCG GATCTCCGAG
CAGACCTCGC CGAAGAACCG CAAGCGGCTG CAGGCGATGC TCGACGGCTA CCAGCTGGAG
GCGATGGCGG TCCGGACCTG TCGGCGTTGC GCGAACGACG GTCGGTATGG CCCCATCACG
AGCGACGGCG CAATCGAGCA CAACGACGAG CTGATCTGTC GCGACTGCGC CCGCCGGGAG
TTGGAACGCG AGCTGTCGTA CAAAGGCGAG TTCACGGGCG CCGCCGAGGA GCGGCTAGAG
GAGCTGCTGT ACGAGTCCGG GGATTTAGAC CGGATCGTCA ACCTCCTGCA AGGCGGACTC
GATCCCGATC TCACCAAGTA CGACGAGGTC TCCGCGAACG TCGACGATAT CTCCCCGGTC
CGGACCGAGG ACCTCGATCT CCATCCGGAT CTCTCTGCAC ATCTGCAGGG CCGCTTCGAG
GAGCTGCTCC CGGTCCAGAG CCTGTCCGTG CGGAACGGTC TCCTCGATGG TACCGACCAG
CTCGTCGTGA GCGCGACCGC GACCGGGAAG ACCCTCGTCG GCGAGCTAGC CGGAATCGAC
CGGGCACTGA AAGGGGACGG GAAGCTCCTC TTTTTAGTCC CTCTCGTCGC GCTCGCCAAC
CAGAAGCACG AGGACTTCAA AGACCGCTAC GGCGACCGGC TGAACGTCTC GATCCGCGTC
GGCTCCTCGC GGGTGAACGA CGACGGAAAC CGCTTCGACC CGAACGCCGA CGTGATCGTC
GGTACCTACG AGGGGATCGA CCACGCACTC CGGACCGGGA AGGACCTCGG TGACATCGGC
ACCGTCGTCA TCGACGAGGT CCACACGCTG AAGGAGGGCG AGCGCGGCCA CCGGCTCGAC
GGGCTCATCT CACGGCTGAA GTACTACAGC GAGAATCGGA TGGAGACCCA CGAGGGGTAC
GGCGGCACCC AGTTCGTCTA CCTCTCGGCG ACGGTGGGGA ACCCAGAATG GCTCGCCGAG
AAGCTCCGAG CCACGCTCAT CGAGTACGAG GAGCGACCCG TCCCCATCGA GCGCCACGTC
ACCTTCGCGG ACAGCCGCGA GAAGGCGCAG ATCGCCGACA AGCTCGTGAA GCGCGAGTTC
GACACGAAAT CCTCGAAGGG GTACCGGGGC CAGACGATCA TCTTCACGAA CTCCCGGCGA
CGATGTCACG AGATCAGCCG GAAGCTCCGG TACGACTCCG CGCCGTATCA CGCCGGACTC
GACTACAAGC GCCGGAAGAA AGTCGAACGG CAGTTCGGGA ACCAGGACCT CTCGGCGGTC
GTCACCACCG CGGCGCTGGC GGCCGGCGTC GACTTCCCCG CCTCGCAGGT GATCTTCGAC
TCGCTGGCGA TGGGGATCGA GTGGCTCTCC GTTCAGGAGT TCTCCCAGAT GCTCGGGCGC
GCCGGGCGGC CGGACTACCA CGACCGTGGC CGTGTCTACC TCCTCGTCGA GCCCGACGGC
GTTTACCACA ACTCCATGGA CCGGACCGAA GATGAGGTGG CGTTCACCCT CCTCAAGGGG
GAGATGGAGG ACGTGGCGAC CCACTACGAC GAGACCGCCG CCGTCGAGGA GACGCTGGCG
AACGTCGTCG TCGCGGGCAA GAAGGCCAAG CGGCTCAACG ATCGGATGAT CGGCGAGGTG
CCGACGAAAC ACGCGGTCGG AAAGTTACTG GAGTGGAAGT TCATCGACGG CTTCTCGCCG
ACGCCGCTCG GTCGCGGTAT CACGAGGCAC TTCCTCGCGC CCGACGAGGC GTTCTTCATG
CTCGACGCGA TCCGGAAGGG GACGGACCCG TACCAGATCG TCGCCGACCT CGAACTGCGC
GACGACGAGG AGTGA
 
Protein sequence
MSQQLAEVET LFLHEARSDY TVVATRDGNR ILRGRLELKE TSAGPRPGKF RVIRDGEDHP 
RQPGEFVDLA RAAGRIRISE QTSPKNRKRL QAMLDGYQLE AMAVRTCRRC ANDGRYGPIT
SDGAIEHNDE LICRDCARRE LERELSYKGE FTGAAEERLE ELLYESGDLD RIVNLLQGGL
DPDLTKYDEV SANVDDISPV RTEDLDLHPD LSAHLQGRFE ELLPVQSLSV RNGLLDGTDQ
LVVSATATGK TLVGELAGID RALKGDGKLL FLVPLVALAN QKHEDFKDRY GDRLNVSIRV
GSSRVNDDGN RFDPNADVIV GTYEGIDHAL RTGKDLGDIG TVVIDEVHTL KEGERGHRLD
GLISRLKYYS ENRMETHEGY GGTQFVYLSA TVGNPEWLAE KLRATLIEYE ERPVPIERHV
TFADSREKAQ IADKLVKREF DTKSSKGYRG QTIIFTNSRR RCHEISRKLR YDSAPYHAGL
DYKRRKKVER QFGNQDLSAV VTTAALAAGV DFPASQVIFD SLAMGIEWLS VQEFSQMLGR
AGRPDYHDRG RVYLLVEPDG VYHNSMDRTE DEVAFTLLKG EMEDVATHYD ETAAVEETLA
NVVVAGKKAK RLNDRMIGEV PTKHAVGKLL EWKFIDGFSP TPLGRGITRH FLAPDEAFFM
LDAIRKGTDP YQIVADLELR DDEE