Gene Pars_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1658 
Symbol 
ID5054514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1495200 
End bp1497413 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content57% 
IMG OID640469201 
ProductAAA family ATPase, CDC48 subfamily protein 
Protein accessionYP_001153863 
Protein GI145591861 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID[TIGR01243] AAA family ATPase, CDC48 subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.814127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGG TTATTCTTAA AGTAGCTGAG GCTAAGTCCC GCGACGTCGG CCGTAGCATT 
GTTAGAATAC CTGTCAGAGT TATGAAAAGG CTTGGTATAG AACCTGGCGA CTACGTGGAG
ATTAGTGGGA GGAAGACGGC GTATGCCCAA GTGTGGCCAG CGTATCCAGA AGACGAGGAT
AAGGAGATTA TTAGGATGGA TGGTATAATA AGGCAAAACG CCGGCGTTGG TATAGGCGAC
ACGGTTAAGG TGAAGAAGGC GGTTCTGAAG CCGGCGCAGA GAGTGGTGCT CGCCCCCACC
GAGCCCGTCA GGGTCGACCC GGAGTACGTC AAGAAGCAGA TTCTGCTAGG CAAGCCGGTG
GCAAGGGGCC AGGCGGTGGA CGTCCCCTTC TACGGCGGCG CCATCCGCTT CGTGGTGGTC
CAGGTACAGC CAGGTCCCGC CGCCTACGTC TCCATCGACA CCGAGGTCAC AGTGAGGGAA
GAGCCAGTCA AGGAGGCCGA GCTGACGATC CCCAGAATCA CTTGGGAGGA TATTGGTGAT
TTGGAGGATG CTAAGCAGAA GATTCGGGAG CTTGTGGAGC TTCCTCTTCG CCACCCGGAG
CTTTTTAAGC ATTTGGGTAT TGAGCCGCCT AAGGGTATTC TGTTAATTGG CCCTCCCGGT
ACTGGGAAGA CGCTTTTGGC AAAAGCCGTT GCCAACGAGG CCAACGCCTA CTTCGTAGCC
ATAAATGGGC CGGAGATTAT GTCTAAGTAC TACGGCGAGA GTGAGGCTAG GCTGAGGGAG
ATATTCGAAG AGGCTAAGAA AAACGCCCCG GCGATAATCT TCATCGACGA AATAGACGCC
ATAGCCCCCA AGAGGGAGGA GGTGACGGGC GAAGTGGAGA AGAGAGTAGT AGCCCAGCTG
TTGACATTAA TGGACGGACT ACAAGAAAGA GGCCAAGTAG TAGTCATAGG AGCCACCAAC
AGACCAGACG CAGTAGACCC AGCACTAAGA AGACCAGGAA GATTCGACAG AGAAATACAC
ATACCCATGC CGGATAAGAG GGCGCGGCGG GAGATATTGG CCGTCCACAC TAGGAATATG
CCGCTGTGCA CTAAGGCCGA TGTGGAAGCC AAGGTGTGCA ACCCCGGCGA CGAGGTTGAC
CTTGACAAAA TTGCAGAGAT GACCCACGGC TACACCGGCG CCGACATAGC GGCTCTCGCT
AAGGAGGCGG CCATGGCGGC GTTGAGAAAG GCGATAAACA AGGGGATGAT CAACATTGAG
CAGGACATAA TCCCCCAGGA GGTGTTGAGC AAGCTGAAGG TAGGCATGTC CGACTTCCTA
GAAGCCATGA AGTTTGTCCA CCCCACCGTG CTCCGCGAGG TCATCATAGA GGTGCCGGAG
GTGCACTGGG ACGACATCGG GGGATACGAC GCAATTAAGC AAGAGCTGAG GGAGATTGTG
GAGTGGCCCA TGAAGTACCG CCACTACTTC GAGGAGCTCG GCGTAGAGCC GCCGAAGGGT
ATACTGCTTT TCGGCCCGCC GGGGGTTGGA AAGACGCTGT TCGCCAAGGC TGTGGCCACG
GAGTCGGGGG CCAACTTCAT AGCCGTTAGG GGGCCGGAGC TCCTCTCCAA GTGGGTTGGC
GAGAGCGAGA AGGCGATACG CGAGGTGTTC AAAAAGGCCC GCATGGCCGC GCCGTGTGTT
ATATTCTTCG ACGAGATAGA CTCGATAGCC CCCGCCAGGG GATCGAGGCT CGGGGACTCC
GGCGTGACCG ACCGCATGGT GAACCAGCTC CTTGCGGAGA TGGACGGCAT TGGGACGTTG
AAAAACGTGG TGGTCATGGC GGCGACGAAT AGGCCAGACA TACTTGACCC CGCCCTGCTG
AGGCCTGGGC GCTTTGACAG GATTATATAC GTGCCGCCGC CCGACATCAA GGCAAGGCTC
GAGATCTTTA AAGTGCACAC AAAGAAGGTT AAGCTGGCCA ACGACGTCAA TTTGGAAGAG
CTGGCGAAGA AGACTGAGGG CTACACAGGC GCCGACATAG CCGCCGTGGT GAGAGAGGCG
GCTATGCTGG CTTTGAGGGA GACGATTAAA GAGAGGAGCG TCGGCGCGAA GCCCGTGTCC
ATGAAACACT TCGAAGAGGC GTTGAAGAGA ATACCGCCGT CGCTTACGCC AGAGGACATG
AGGCGCTACG AAGAAGTCGC TAAGAGACTC AGGCGGGCAA TAGCCGGCTT ATAA
 
Protein sequence
MSEVILKVAE AKSRDVGRSI VRIPVRVMKR LGIEPGDYVE ISGRKTAYAQ VWPAYPEDED 
KEIIRMDGII RQNAGVGIGD TVKVKKAVLK PAQRVVLAPT EPVRVDPEYV KKQILLGKPV
ARGQAVDVPF YGGAIRFVVV QVQPGPAAYV SIDTEVTVRE EPVKEAELTI PRITWEDIGD
LEDAKQKIRE LVELPLRHPE LFKHLGIEPP KGILLIGPPG TGKTLLAKAV ANEANAYFVA
INGPEIMSKY YGESEARLRE IFEEAKKNAP AIIFIDEIDA IAPKREEVTG EVEKRVVAQL
LTLMDGLQER GQVVVIGATN RPDAVDPALR RPGRFDREIH IPMPDKRARR EILAVHTRNM
PLCTKADVEA KVCNPGDEVD LDKIAEMTHG YTGADIAALA KEAAMAALRK AINKGMINIE
QDIIPQEVLS KLKVGMSDFL EAMKFVHPTV LREVIIEVPE VHWDDIGGYD AIKQELREIV
EWPMKYRHYF EELGVEPPKG ILLFGPPGVG KTLFAKAVAT ESGANFIAVR GPELLSKWVG
ESEKAIREVF KKARMAAPCV IFFDEIDSIA PARGSRLGDS GVTDRMVNQL LAEMDGIGTL
KNVVVMAATN RPDILDPALL RPGRFDRIIY VPPPDIKARL EIFKVHTKKV KLANDVNLEE
LAKKTEGYTG ADIAAVVREA AMLALRETIK ERSVGAKPVS MKHFEEALKR IPPSLTPEDM
RRYEEVAKRL RRAIAGL