Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0841 |
Symbol | |
ID | 5054868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 746780 |
End bp | 747964 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640468402 |
Product | hypothetical protein |
Protein accession | YP_001153079 |
Protein GI | 145591077 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.391136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGGGT GGAGGGCTGT TGTACTTGCA CTGCTCGCCT CCGCCTTGGC GCTGGCCGCA TCCAACGCCG TGGCCTTGTA CCTCGATGGC ACCATTGACG GAACCGCAGT TACCCTAGCC CGAGCGGCGC TGGCGGATGC CCAAAGCCGC GGGTTACCTC TCGTGGTGGT TATCAATACC TACGGGGGTT TCCTTGCCCC AATGGATCAA ATTGTGGAGC TGTTCTTAAA CGCGGGGGTT CCGGTATATG CCTACGTGCC GGAGGGGGGC AAGGCCGTTT CGGCGGGGGC CTTCGTGGCC ATGGCGGCTA GGAGGATCTA CATGGCGCCC ACCGCCGAGA TAGGCGCCGC CGAGCCTAGG CCGCCTGACC CCAAGGTGGT GAACTACGCC GCGGCCCGGA TGAGGGCGCT GGCGTCTGCC AAGTGGAACG ACTCCAGGGT GGACATAGCC GAGTCTTTTG TCAGGGAGAA CAAGGTGCTC ACAGGAGCGG AGGCCGTTAA GCTGGGAATC GCTGAGCCAC TGCCGTCGGG AGGCTGGGTT TTTGTCGCCG AGTACCGGAG GGACCCCCTC TCCAGCCTCC TAAACGCCTT GTCTGATCCC GCGGTGATAT CGCTACTGCT CCTGCTGGGC GTCGTGTTCA TTGGCTACGA GCTCCTAGCC GGCGGCTTCC AAGGCGTCGG CGTAGTCGGG GGGCTTTTAC TAGTGCTCGC CCTCTACCTC TTGGGCCAGC TGGGCTCTGA GTGGCTCTGG GCGGCGCTGG CCATCGGCGG GGCTACGCTC ATCGCCGCGG AGATCTTCGC CGGCCACGGC GCCTTCGCCG CCACTGGTCT GGCCCTCTTC GGCCTCTCCC TATACTTCGC AAGCGTCAGC CAGCCCTACT ACCAGCTCCA AGGCGCCTCC TACGCCCTGT CCTCCTTGGC CGCGCTGGGC GCCTTGGCCG TGGCCTACCT GGGCTATAAG GTGAGGCAGG CGATGCGGAG AAAGCCGCTT AACTACAAGG CACAGCTGGT GGGGGCCTTG GGGGTCGCCA AGACTGAGAT AAGGCCGGGT CAGCCGGGGG TGGCGTACGT GGCGGAGGAG GAGTGGACAG CTGTCTCAGA CGAGGAGATA AAGCCAGGGG AGAGGGTGGT GGTGGAGGGC GTTGAGGGCC TCACCTTAAT GGTTAAAAAG GCCAAGTCTG CATAG
|
Protein sequence | MAGWRAVVLA LLASALALAA SNAVALYLDG TIDGTAVTLA RAALADAQSR GLPLVVVINT YGGFLAPMDQ IVELFLNAGV PVYAYVPEGG KAVSAGAFVA MAARRIYMAP TAEIGAAEPR PPDPKVVNYA AARMRALASA KWNDSRVDIA ESFVRENKVL TGAEAVKLGI AEPLPSGGWV FVAEYRRDPL SSLLNALSDP AVISLLLLLG VVFIGYELLA GGFQGVGVVG GLLLVLALYL LGQLGSEWLW AALAIGGATL IAAEIFAGHG AFAATGLALF GLSLYFASVS QPYYQLQGAS YALSSLAALG ALAVAYLGYK VRQAMRRKPL NYKAQLVGAL GVAKTEIRPG QPGVAYVAEE EWTAVSDEEI KPGERVVVEG VEGLTLMVKK AKSA
|
| |