Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0978 |
Symbol | |
ID | 5054230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 868423 |
End bp | 869610 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640468534 |
Product | hypothetical protein |
Protein accession | YP_001153210 |
Protein GI | 145591208 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1651] Protein-disulfide isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000000566391 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTACCTCC TACAGTCTCA ACCACCTGCA AATAAGACCT CCACACAGCC TGTTCAAACG ACATCCGGCA AGCTTTACGT ATACACGGCA CGACTCGAAG GGGGACAAGC CTTGGAGCTA CTAACTTACT ACATACCTTT GAGCAGCGGC ACCGTCTACG CCCAGATCTA CAACAACTCA GCCTCCTACT TCATGGTGAA AAACACGGGT ATAATCAACG TCCTCTCGCA GACGCTAAAC AGCTACCAGA AAGCCACCTA CTACAGCAAA TTGCTACAGA TATGCGTGAA CTCGACTACA ACCACGACTG TGGCGGGCGA GCGCATAACC CTCTCCAATA GCCAATGCCA GCAGTCCACG TCTCCCCTCC CCACGGCTAA GACATTCGAC GAGCTGGTCC TACTTATCCA AGGACTTCCC GGGCCCACGT CCCCGAGCCA GTGGAAGCAG TCAGGCACAG CCCAAACCCC CTACGGCCAA GCCACAGTAT ACACAAACAC CACCGAGGTA CCGATAATGC AAGGCCTATC GGCCATCCTA GATTATGAGA AACAACAACT CGCAGATGGG ACTATATACC AATTTAAGGT CAAGCTATCC TACGGCGGGC AGGTAGCCGC CATTTTGACG TACACCCTTC GGAACATCAC GACAATACCG GCCGACGTCG CTAATGTGCT GAACGAGCTT TCTAGAGACG TCGTCGGGAC AAACGGCGGC GGCCTTGACA TCCTCAGAGT CGCAGAGAAG ATAGGCATGA AATACGACGG CAAATGGCCA GCCGCAATCG TCTTCTTCGA CTTACAGTGC CCCTACTGTG CCCAGTTGTT TAAATACAAC TACACCCTAT TCGAGGGACA CAGACTAGTC TTGGTAGATC TAATAGTCCA CCCAGACGCA TTGCCCGCCC ACGAGAGGTT GAGATGCCTC TACAACAGCA CGCCGCAAGA GGTAATCCCA ACGCTGAGGG TAGTATACGA CCGCTTCCTA GCCGGAGACC AAAACTACAC TAGTATACTC CCGCAAAGTA GTTGTCCAAT AGATGCAAAT GCTGGAATGC AACTAGCTAC TCTCATTGCT GGGCAGAATG TGGGGACGCC GCTGGTAGTT GTGGTCTACC CCAACGGGAC CTACACCTAC GTTGTGGGAT ACGACCCAGC TAGTATTGCA AAGGCACTTA AGGGGTGA
|
Protein sequence | MYLLQSQPPA NKTSTQPVQT TSGKLYVYTA RLEGGQALEL LTYYIPLSSG TVYAQIYNNS ASYFMVKNTG IINVLSQTLN SYQKATYYSK LLQICVNSTT TTTVAGERIT LSNSQCQQST SPLPTAKTFD ELVLLIQGLP GPTSPSQWKQ SGTAQTPYGQ ATVYTNTTEV PIMQGLSAIL DYEKQQLADG TIYQFKVKLS YGGQVAAILT YTLRNITTIP ADVANVLNEL SRDVVGTNGG GLDILRVAEK IGMKYDGKWP AAIVFFDLQC PYCAQLFKYN YTLFEGHRLV LVDLIVHPDA LPAHERLRCL YNSTPQEVIP TLRVVYDRFL AGDQNYTSIL PQSSCPIDAN AGMQLATLIA GQNVGTPLVV VVYPNGTYTY VVGYDPASIA KALKG
|
| |