Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1704 |
Symbol | |
ID | 5054516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1538195 |
End bp | 1539871 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640469247 |
Product | thermosome |
Protein accession | YP_001153907 |
Protein GI | 145591905 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.52837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTATT TATGTCAAGC CATGGCACAG CAAGCACCAA AGTCAGGAGT TCCGGTAATG ATACTAAAGG AGGGTTCCCA GCGTACCACC GGCGTTGACG CCCGGCGCTC TAACATACAG GCTGCTAAGG TAATCGCGGA GATACTGGCG ACATCCCTAG GTCCTCGCGG AATGGACAAG ATGCTCATCG ACGCCTTTGG GGACGTCACG ATTACTGGTG ATGGCGCTAC GATTCTCAAG GAGATGGAAG TCCAGCACCC CGCTGCCAAG CTGTTGATCG AAGTAGCGAA GGCCCAAGAC GCCGAGGTCG GCGACGGTAC CACGACAGTC GTGGTCCTCG CAGGCAAGCT CCTTGAGCTC GGCGAAGAGC TCCTCGAGGA GGGGATCCAC CCGACCATTG TGGTAGACGG CTACAAGAAG GCCTCCGACT ATGCTCTGAA GGTGGCCGAG GAGGTCGCCA AGCCCATTGA ACTTACCAAG GAGCAGTTGC TGAAGGTTGT GTCCAGCGCC CTTTCCTCTA AGGTAGTAGC TGAGACTAGG GACTACCTCG CCGGTCTCGT CGTCGAGGCG GCGATGCAGG CAGTGGAACA GAGGGACGGC AAGCCGTATC TAGACCTAGA CTGGATTAAG ATCGAGAAGA AGAAGGGCAA GTCCATCTAC GAGACCCAGC TGATTAGGGG CATTGTGCTG GACAAGGAGG TGGTGCACCC CGGCATGCCG AAGCGCGTCA CCAATGCCAA AATCGCCATT CTAGACGCGC CTCTGGAGAT CGAGAAGCCC GAGTGGACGA CGAAGATAAG CGTGACCAGC CCCGACCAGA TCAAGGCCTT CCTCGACCAG GAGGCGGAGA TCCTCAAGTC GTACGTGGAA CACTTGGCCT CCATCGGCGC CAACGTGGTA ATTACGCAGA AGGGCATCGA CGAGGTGGCC CAGCACTTCT TGGCGAAGAA GGGCATACTG GCGGTTAGGA GAGTGAAGAG GAGCGACATC GAGAAACTGG CGAGGGCTAC AGGCGCCAAG ATAATTACGT CCATTAAGGA CGCCAGACCT GAGGACCTCG GCACAGCTGG CCTCGTCGAA GAGAGGAAGG TGGGCGAAGA GAAAATGGTG TTTGTAGAGG ACATCCCCAA CCCGAGGGCC GTCACCATCC TGGTGAGGGG CGGCAGCGAC CGCATACTAG ACGAGGTCGA GCGCTCTCTG CAAGACGCCC TCCACGTGGC CCGCGACCTG TTCAGAGAGC CTAAGATCGT GCCCGGCGGC GGCGCCTTCG AGGTAGAGGT GGCAAGGAGA GTGAGGGAGT ACGCAAGGAA GCTACCAGGC AAGGAGCAAC TCGCGGCGCT GAAATTCGCC GACGCCCTTG AGCACATCCC CACCATACTG GCGCTGACGG CGGGCCTTGA CCCCGTAGAC GCAATCGCCG AGCTGAGGAG GAGGCACGAC AACGGCGAGC TCACCGCCGG CGTAGACGTC CACGGCGGCA AGATCACCGA CATGGCCGCC CTCAACGTGT GGGATCCGCT AATTGTGAAG AAGCAGGTAA TCAAATCGGC GGTGGAGGCC GCGATAATGA TACTACGCAT CGATGACATA ATCGCAGCGG GAGCGCCGAA GAAAGAGGAG AAGAAAGGCA AGAAAGAGGA GGGCGAAGAA GAGAAGGGCG AGACCAAGTT TGACTAA
|
Protein sequence | MHYLCQAMAQ QAPKSGVPVM ILKEGSQRTT GVDARRSNIQ AAKVIAEILA TSLGPRGMDK MLIDAFGDVT ITGDGATILK EMEVQHPAAK LLIEVAKAQD AEVGDGTTTV VVLAGKLLEL GEELLEEGIH PTIVVDGYKK ASDYALKVAE EVAKPIELTK EQLLKVVSSA LSSKVVAETR DYLAGLVVEA AMQAVEQRDG KPYLDLDWIK IEKKKGKSIY ETQLIRGIVL DKEVVHPGMP KRVTNAKIAI LDAPLEIEKP EWTTKISVTS PDQIKAFLDQ EAEILKSYVE HLASIGANVV ITQKGIDEVA QHFLAKKGIL AVRRVKRSDI EKLARATGAK IITSIKDARP EDLGTAGLVE ERKVGEEKMV FVEDIPNPRA VTILVRGGSD RILDEVERSL QDALHVARDL FREPKIVPGG GAFEVEVARR VREYARKLPG KEQLAALKFA DALEHIPTIL ALTAGLDPVD AIAELRRRHD NGELTAGVDV HGGKITDMAA LNVWDPLIVK KQVIKSAVEA AIMILRIDDI IAAGAPKKEE KKGKKEEGEE EKGETKFD
|
| |