Gene Hlac_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2069 
Symbol 
ID7400589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2056804 
End bp2059134 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content72% 
IMG OID643709140 
ProductAAA ATPase central domain protein 
Protein accessionYP_002566717 
Protein GI222480480 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID[TIGR01243] AAA family ATPase, CDC48 subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.963415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.166628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGTA AGGGTAAAGG GCGAGTGGCC GGAACGGAGC GTATGAACGA CACCGCCGGC 
CTCCGGCTCA CGGTGCGTGC CGCCGAGAAG CGGGACGCGG GCCGGGGCAT CGCCCGGCTC
CCCGAATCCT CCCGAAAGCG GCTCGGCCTC CTCAGCGGCG ACACCGTCGA GATCCGCGGT
GAGCGCACCG CGGTCGCGAA GGTGTGGCCC GGCGGCGCGG ACGCCCCAGA AGGATCCGTC
CTCATCGACG CGGACACTCG CGCGAACGCC GGCGTGAAGG TCGGCGACGC CGTCACCGTC
GCCGCCGTCG ACGTCGAGGA CGCCGACCGC ATCGTGCTCG CCGCGCCCGC GGAGCTCGCC
GACGTGGACG TGAGCCGCGA GGTGGTCGAG CGCGCGCTCG CCCGCGACCT CCGCGACCGT
CCCGTCACCG AGGGCGAGGC GGTCCACGTT GAGCGGCTCG GCGGGATTCG ATTCGTCGTC
GCCGAGACGG ACCCGGCGGG CACGGTGCGC GTCACGAGCC GGACCGACGT GTCGGTGAGA
TACGGGGACG AGGACGCGTC GCGCTCCGAC GCGGGCGCGG GGCGGGATCG CCGGACTGCC
TCCGGATCTT CGACGGGGAG CGGTCCCTCG TCCGACAGAT CCATCGGCAC CGACACCGCC
AGCACCGACA CCGCCGGCAC CGACACCTCC GGAACCGACG CCCGCCCGCC CGGCGGCACC
GAACCCCCGG CGGAACACAC CGCGGGCGCG ACCTACGAGG ACATCGGCGG GCTCGACGAG
GAACTGGAGT TAGTCCGCGA GACGATCGAG CTCCCGCTCT CGGAGCCCGG CGTGTTCACC
CGGCTCGGGA TCGACCCGCC GAAGGGCGTC CTGCTCCACG GCCCGCCGGG GACCGGGAAG
ACGTTGATCG CCCGCGCCGT CGCCAACGAG GTCGACGCGA CGTTCATCAC CGTCGACGGC
CCGGAGATCA TGTCGAAGTA CAAAGGCGAG TCGGAGGAGC GCTTACGGGA CGTGTTCGAG
CGGGCCAGCG AGGAGGCCCC CGCGATCATC TTCTTCGACG AGATCGACTC GATCGCGGGC
AAGCGCGACG ACGGCGGCGA CGTGGAGAAC CGCGTCGTCG GCCAGTTGCT CTCGCTGATG
GACGGGCTCG ACGCCCGCGG CGACGTGATC GTCATCGGCG CGACCAATCG CGTCGACACC
CTCGACCCCG CCCTCAGACG GGGCGGTCGG TTCGACCGCG AGATCGAGAT CGGGGTTCCC
GGTGAGGCCG GCCGTCGCCA GATCCTCGAC GTCCACACGC GTCGGATGCC CCTCGCGGAC
GACGTGGACT TGGACCGCAT CGCGGCCCGG ACCCACGGGT TCGTCGGGGC CGACATCGAG
GGGCTCACGC AGGAGGCGGC GATGACCGCC CTGCGGCGCG CCCGCGAATC GGACGCTGCG
GCACTCGACG ACGTGACGGT CGGGAAGGCG GACTTCGAGG CCGCCCACGC CGCCGTCGAA
CCGAGTGCGA TGCGCGAGTA CGTCGCCGAG CAGCCGACCA CCGACTTTAC GGATGTCGGC
GGGCTCCCGG AAGCGAAGGA GAAGCTAGAA CGAGCCGTCA CGTGGCCGTT GACGTACGGC
CCCCTCTTCG AGGCCGCCGA CGCCGACCCG CCGACGGGGA TCCTGCTCCA CGGACCGCCC
GGAACGGGAA AGACTCTCCT CGCTCGGGGG ATCGCGGGCG AGAGCGGCGT GAACTTCATT
CAGGTGGCCG GCCCGGAGCT GCTCGACCGG TACGTCGGCG AGTCCGAGAA GGCGGTCCGT
GACCTGTTCG ATCGCGCGCG GCAGGCGGCG CCTGTGATCA TTTTCTTCGA CGAGATCGAC
GCGATCGCCG CCGATCGCGA CGCCGCTGGC GGGGACAGTT CGGGCGTCGG CGAGCGGGTG
GTCTCCCAGC TGCTCACGGA ACTCGACCGC GCGAGCGACA ACCCGAACCT CGTCGTGCTC
GCGGCGACGA ACCGGCGGAA CGCACTCGAT CCAGCGCTGC TCCGGCCCGG ACGGTTAGAG
ACGCACATCG AGGTGCCCGA GCCCGACCGC GAGGCGCGCC GGAAGATCCT CGACGTACAC
ACCCGCACGA AGCCCCTCGT CGAGGGCGTC GACTTAGAGC ACCTCGCCGA CGAGACCGAG
GGGTACTCCG GCGCTGAGAT CGCCTCGCTG TGCCGGGAGG CCGCCCTGAT CGCCATCGAG
CGCGTCGCGG ACGAGCACGG TGAGGCCGCC AACGATCACG CCGACGAGGT CGGGATCACC
GCTGACGACT TCGCGGCGGC GCTGGAAACC GTCCGCCCGG CGACGCCGTA A
 
Protein sequence
MAGKGKGRVA GTERMNDTAG LRLTVRAAEK RDAGRGIARL PESSRKRLGL LSGDTVEIRG 
ERTAVAKVWP GGADAPEGSV LIDADTRANA GVKVGDAVTV AAVDVEDADR IVLAAPAELA
DVDVSREVVE RALARDLRDR PVTEGEAVHV ERLGGIRFVV AETDPAGTVR VTSRTDVSVR
YGDEDASRSD AGAGRDRRTA SGSSTGSGPS SDRSIGTDTA STDTAGTDTS GTDARPPGGT
EPPAEHTAGA TYEDIGGLDE ELELVRETIE LPLSEPGVFT RLGIDPPKGV LLHGPPGTGK
TLIARAVANE VDATFITVDG PEIMSKYKGE SEERLRDVFE RASEEAPAII FFDEIDSIAG
KRDDGGDVEN RVVGQLLSLM DGLDARGDVI VIGATNRVDT LDPALRRGGR FDREIEIGVP
GEAGRRQILD VHTRRMPLAD DVDLDRIAAR THGFVGADIE GLTQEAAMTA LRRARESDAA
ALDDVTVGKA DFEAAHAAVE PSAMREYVAE QPTTDFTDVG GLPEAKEKLE RAVTWPLTYG
PLFEAADADP PTGILLHGPP GTGKTLLARG IAGESGVNFI QVAGPELLDR YVGESEKAVR
DLFDRARQAA PVIIFFDEID AIAADRDAAG GDSSGVGERV VSQLLTELDR ASDNPNLVVL
AATNRRNALD PALLRPGRLE THIEVPEPDR EARRKILDVH TRTKPLVEGV DLEHLADETE
GYSGAEIASL CREAALIAIE RVADEHGEAA NDHADEVGIT ADDFAAALET VRPATP