Gene Pars_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1704 
Symbol 
ID5054516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1538195 
End bp1539871 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content59% 
IMG OID640469247 
Productthermosome 
Protein accessionYP_001153907 
Protein GI145591905 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.52837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTATT TATGTCAAGC CATGGCACAG CAAGCACCAA AGTCAGGAGT TCCGGTAATG 
ATACTAAAGG AGGGTTCCCA GCGTACCACC GGCGTTGACG CCCGGCGCTC TAACATACAG
GCTGCTAAGG TAATCGCGGA GATACTGGCG ACATCCCTAG GTCCTCGCGG AATGGACAAG
ATGCTCATCG ACGCCTTTGG GGACGTCACG ATTACTGGTG ATGGCGCTAC GATTCTCAAG
GAGATGGAAG TCCAGCACCC CGCTGCCAAG CTGTTGATCG AAGTAGCGAA GGCCCAAGAC
GCCGAGGTCG GCGACGGTAC CACGACAGTC GTGGTCCTCG CAGGCAAGCT CCTTGAGCTC
GGCGAAGAGC TCCTCGAGGA GGGGATCCAC CCGACCATTG TGGTAGACGG CTACAAGAAG
GCCTCCGACT ATGCTCTGAA GGTGGCCGAG GAGGTCGCCA AGCCCATTGA ACTTACCAAG
GAGCAGTTGC TGAAGGTTGT GTCCAGCGCC CTTTCCTCTA AGGTAGTAGC TGAGACTAGG
GACTACCTCG CCGGTCTCGT CGTCGAGGCG GCGATGCAGG CAGTGGAACA GAGGGACGGC
AAGCCGTATC TAGACCTAGA CTGGATTAAG ATCGAGAAGA AGAAGGGCAA GTCCATCTAC
GAGACCCAGC TGATTAGGGG CATTGTGCTG GACAAGGAGG TGGTGCACCC CGGCATGCCG
AAGCGCGTCA CCAATGCCAA AATCGCCATT CTAGACGCGC CTCTGGAGAT CGAGAAGCCC
GAGTGGACGA CGAAGATAAG CGTGACCAGC CCCGACCAGA TCAAGGCCTT CCTCGACCAG
GAGGCGGAGA TCCTCAAGTC GTACGTGGAA CACTTGGCCT CCATCGGCGC CAACGTGGTA
ATTACGCAGA AGGGCATCGA CGAGGTGGCC CAGCACTTCT TGGCGAAGAA GGGCATACTG
GCGGTTAGGA GAGTGAAGAG GAGCGACATC GAGAAACTGG CGAGGGCTAC AGGCGCCAAG
ATAATTACGT CCATTAAGGA CGCCAGACCT GAGGACCTCG GCACAGCTGG CCTCGTCGAA
GAGAGGAAGG TGGGCGAAGA GAAAATGGTG TTTGTAGAGG ACATCCCCAA CCCGAGGGCC
GTCACCATCC TGGTGAGGGG CGGCAGCGAC CGCATACTAG ACGAGGTCGA GCGCTCTCTG
CAAGACGCCC TCCACGTGGC CCGCGACCTG TTCAGAGAGC CTAAGATCGT GCCCGGCGGC
GGCGCCTTCG AGGTAGAGGT GGCAAGGAGA GTGAGGGAGT ACGCAAGGAA GCTACCAGGC
AAGGAGCAAC TCGCGGCGCT GAAATTCGCC GACGCCCTTG AGCACATCCC CACCATACTG
GCGCTGACGG CGGGCCTTGA CCCCGTAGAC GCAATCGCCG AGCTGAGGAG GAGGCACGAC
AACGGCGAGC TCACCGCCGG CGTAGACGTC CACGGCGGCA AGATCACCGA CATGGCCGCC
CTCAACGTGT GGGATCCGCT AATTGTGAAG AAGCAGGTAA TCAAATCGGC GGTGGAGGCC
GCGATAATGA TACTACGCAT CGATGACATA ATCGCAGCGG GAGCGCCGAA GAAAGAGGAG
AAGAAAGGCA AGAAAGAGGA GGGCGAAGAA GAGAAGGGCG AGACCAAGTT TGACTAA
 
Protein sequence
MHYLCQAMAQ QAPKSGVPVM ILKEGSQRTT GVDARRSNIQ AAKVIAEILA TSLGPRGMDK 
MLIDAFGDVT ITGDGATILK EMEVQHPAAK LLIEVAKAQD AEVGDGTTTV VVLAGKLLEL
GEELLEEGIH PTIVVDGYKK ASDYALKVAE EVAKPIELTK EQLLKVVSSA LSSKVVAETR
DYLAGLVVEA AMQAVEQRDG KPYLDLDWIK IEKKKGKSIY ETQLIRGIVL DKEVVHPGMP
KRVTNAKIAI LDAPLEIEKP EWTTKISVTS PDQIKAFLDQ EAEILKSYVE HLASIGANVV
ITQKGIDEVA QHFLAKKGIL AVRRVKRSDI EKLARATGAK IITSIKDARP EDLGTAGLVE
ERKVGEEKMV FVEDIPNPRA VTILVRGGSD RILDEVERSL QDALHVARDL FREPKIVPGG
GAFEVEVARR VREYARKLPG KEQLAALKFA DALEHIPTIL ALTAGLDPVD AIAELRRRHD
NGELTAGVDV HGGKITDMAA LNVWDPLIVK KQVIKSAVEA AIMILRIDDI IAAGAPKKEE
KKGKKEEGEE EKGETKFD