Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1339 |
Symbol | |
ID | 5054156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1205088 |
End bp | 1206095 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468885 |
Product | hypothetical protein |
Protein accession | YP_001153554 |
Protein GI | 145591552 |
COG category | [S] Function unknown |
COG ID | [COG1817] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00496321 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.353078 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTATTGGC GCTTCATGGT TAGGTTTCTC TCCGACGCGT TGACGCCGAA GCAGGCACGT ATCGCCGCGT TGCTCAAGCT TGAGGGGGCC AAGCGCGGCG TTGAGGTGGA GATAACGTGC CGCCACTACA TGCATGTTTC AGACATACTC GACATGTACG GCGTCTCTTA TAGATGTTTT GGACAATACG GCCTCACTGT ATACGAAAAG CTTGTGTACG GCATCGAGAG ACAGAGGGAG TTGGCCGAGG TGGCGAGGCA GGTAGACGGA ATGCTGGGCT TCCCATCCCC AGACGCGGCG AGGGTGGTGT TTGGGCTGGG AAAGCCCGTG TTGGTGCTCA ACGACACCCC CCACGCAACT CACGTAAATA GGCTAGTCAT ACCGCTTTCG GAAGCTCTCG TAGCACCCGC GGCCATCCCC GAGGAGATGT GGCGCCCCTA CTGCCCCAGG AAAGTTGTCA CTTTCGACGG GGTATTCGAG TATATGTGGA CGTCGAGGTT TAAACCTGAT GAGTCTGTGG TGAAGAGCCT CGGCTTGGAG CCAGGCGGAT ACGTGGTTTT TAGGCCGGAG GAGAGGTATG CGGCGTATTA CAAGTGGGAA TACACAGAGC TTCGCATAAA GCTGGCTAGG GCTGTGGAGG GCCTTGGTTA CAATGTAGTT AACGTGCCGC GCTATCCGGA CCAGGTGCTG GAGGGGGCCA TCAACTTGAC TAGGGCTGTG GATCACTTGC AACTGGCATA CTTCTCGGCG GGGGTTATAA CTGGGGGCGC CTCGATGGCC ACAGAAGCTG CGCTTCTAGG CGTGCCTGCG TTGTCCTATT TCCCCCAGAG CTACTACGTA GATCGTTATC TTGCAGAGAA GGGAGCCCCG CTTTACCGGT GCGACAGCTT AGAGACTTGC CTCTCGAGTC TCAGAGAGAT GTTGCGCCGC GGCAGGTCTG CGCCAGTAAG GCTTGAAGAC CCCGCCGGGA TTATTTTCGA TGCGGCACTA AGCGCTGTTT CAAGATAA
|
Protein sequence | MYWRFMVRFL SDALTPKQAR IAALLKLEGA KRGVEVEITC RHYMHVSDIL DMYGVSYRCF GQYGLTVYEK LVYGIERQRE LAEVARQVDG MLGFPSPDAA RVVFGLGKPV LVLNDTPHAT HVNRLVIPLS EALVAPAAIP EEMWRPYCPR KVVTFDGVFE YMWTSRFKPD ESVVKSLGLE PGGYVVFRPE ERYAAYYKWE YTELRIKLAR AVEGLGYNVV NVPRYPDQVL EGAINLTRAV DHLQLAYFSA GVITGGASMA TEAALLGVPA LSYFPQSYYV DRYLAEKGAP LYRCDSLETC LSSLREMLRR GRSAPVRLED PAGIIFDAAL SAVSR
|
| |