Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1204 |
Symbol | |
ID | 5054328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1090341 |
End bp | 1092326 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640468751 |
Product | hypothetical protein |
Protein accession | YP_001153424 |
Protein GI | 145591422 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1331] Highly conserved protein containing a thioredoxin domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.139299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGAG AGGCGTGTTT GAGGAGGTCT ACGTCGCCCT TTGTGTTGGA TGGGCTGAGG AGCAAGGTGC AGTGGTGGGC GTGGTGCGAG GAGGCTTTCC AGAAGGCGAA GGCGGAGGAC AAGCCTATAT TAGTGGACGT GGGCGCCGTC TGGTGTCATT GGTGCCACGT TATTGACGAG ACGACGTACA ACGATGACGA GATTGCCGAT ATCATAAACA AGCATTTCGT GCCGATTAAG GTAGATCGGG ACGAGAGGCC AGACGTAGAC CGCCGACTGC AAGAATATGC GGTTTTAGTT AGCGGCCAGT CTGGTTGGCC TCTCACCGTC TTCATGACGC CGGAGGGGGA GGTGATCTGG GCCGCCACGT ATCTTCCGCC GAGGGACTAC GGCGGCTTGC CGGGGATGGC CAAAGTGCTG AGGGCTGTTC TGGAGGCGTA CAGGACGAAG AAAGGCGATA TCAAGAAGAT GGCGGAGGAT CTCTCTAAGG AGATAGCGGC GTGGCACAAC CCTTCGGAGG CCGAGCTTGA CCGCTCCGTC CAACTTGACA TACTGGCGTC GCTGGCCGCC TCCTTCGACG AGGAGTACGG CGGCTTCGGC ACGGCGCCTA AGTTCCCGCC GATCACCCAG CTGGATCTGT TGTTGTTGCG GCATTTCTAC GACGGGAAGT CGGTCTACGG TAAGATGGCC CATGCGACCT TGAGGGCCAT GGCGCGAGGA GGGGTCTACG ACCAGCTTGG TGGCGGCTTC TTCCGCTACT CCACTGACCG CTTGTGGCTT ATCCCCCACT ACGAGAAGCT CCTAGTAGAC AACGCAGAGC TGTTGTCGCT CTACGCCAGG GCATATGCCC ACTTCGGCGA CCAGCTGTAT AGAAAAACGG CGGCGGGGAT CATCAAGTGG CTCGACGAAT TCATGCGCGA CCCGGGCGGC GGATACTACG CCAGCCAAGA CGCCGACGTA GACGGGGAGG AGGGCGCCTA CTACCGCTGG ACGGAGGACG AGCTTAAGGA GACCCTGGGC GATCTCTTCC CCAAAGCGGC TGATATGTTT GGCCTATACG AATTTAAGTG GCCCGAGGGG CGGGCTACCC TAAGCATAGT TAGGGTTGTG CCGGAAGCCG ACTTGATCCT TGAGAGGCTG GCGGAGGCCC GCAAGGCGAG GAAGCCACCG AGGGTGGACA CCACGATTTA CGCCGGTTGG AGTTGCGCCA TGGCTAAGGC GGAGCTGGAG GCGAGCCGCC TGGCGGGGAT AGGGGACAAG GAGTTCGCCT TGAAGACTCT TGACAAGATC AGGAGGGAGG CGTGGGACGG CTCGAGGCTG GCCCGCGGGC TTAGGGGCGG GGGGCCCGTG GGGGAGGGAG TTCTGGAGGA CTACGCCTAC TGCGCCTTAG CCGCGCTGGA GGCCTACTCC CACACCGGCA GATACCTGGA CTGGGGCGTA GAGGTGGCGG GGGCGATGGT GGATAGGTTC CTAGACCAGG GAGGGTTTAG AGATGTGGAG AGGCCAGACC CCGTGTTGAA GACGCCGCAC TACCCCGTGG CTGATACACC CAACTACTCG GGGAACGCCC TGGCCATATT GGCGTGCGAC CTTCTGCACT ACGCCACGGG TATCCGCAAG TTTAGAGACG CGGCGGAGAG GGCTCTGAAG GCGCTGGCGG GCAAGCTGGC GAGGCTAGGG CCCTCCGCCG CCGGGTTGGC CATCGCCCTG GACGCCCACT TGGCCGAGCC TCCCCGGACG GTGGTTGTGG GCTCTGCCGA GGAGCTTCTG AGGGCGGCCC TTGCGGCGTA CCGCCCCCTG CACGTGGTGA TGCCTGTGGC AAGCGGCTGG GACTACCCCG AGCCCTCCAT AAAGGCCATG CTGGCGGGGC CGAAGCCGGC GGCCTACGTC TGCGCTTGGG GGGCTTGCTC CATGCCGATA TCCGACCCGG GGAGGCTGGG GGAGGCCGTT AGGAAATTTA GGAGGGAGGC CTACGGGCTA GAATAG
|
Protein sequence | MDREACLRRS TSPFVLDGLR SKVQWWAWCE EAFQKAKAED KPILVDVGAV WCHWCHVIDE TTYNDDEIAD IINKHFVPIK VDRDERPDVD RRLQEYAVLV SGQSGWPLTV FMTPEGEVIW AATYLPPRDY GGLPGMAKVL RAVLEAYRTK KGDIKKMAED LSKEIAAWHN PSEAELDRSV QLDILASLAA SFDEEYGGFG TAPKFPPITQ LDLLLLRHFY DGKSVYGKMA HATLRAMARG GVYDQLGGGF FRYSTDRLWL IPHYEKLLVD NAELLSLYAR AYAHFGDQLY RKTAAGIIKW LDEFMRDPGG GYYASQDADV DGEEGAYYRW TEDELKETLG DLFPKAADMF GLYEFKWPEG RATLSIVRVV PEADLILERL AEARKARKPP RVDTTIYAGW SCAMAKAELE ASRLAGIGDK EFALKTLDKI RREAWDGSRL ARGLRGGGPV GEGVLEDYAY CALAALEAYS HTGRYLDWGV EVAGAMVDRF LDQGGFRDVE RPDPVLKTPH YPVADTPNYS GNALAILACD LLHYATGIRK FRDAAERALK ALAGKLARLG PSAAGLAIAL DAHLAEPPRT VVVGSAEELL RAALAAYRPL HVVMPVASGW DYPEPSIKAM LAGPKPAAYV CAWGACSMPI SDPGRLGEAV RKFRREAYGL E
|
| |