Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1432 |
Symbol | |
ID | 5054831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1290555 |
End bp | 1291910 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468973 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001153642 |
Protein GI | 145591640 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.503563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0109836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGACTAG TTATTACCTG GGACAGGGGG ACTATACTAC TAGAGGGCGA GGTCCCCAAC GAGATTAAGA CGCTATCTTT TATCAAATTC GACGGGAGGG TCGGAAAGTA CAGAGCCCTG GCGATATACT ACCCGCGGCT ATTGGCCGTG GCGAAGTCGC TTGGCCAAGA GGTGGAGGAC AGGGTTTGGG GCCTCCAGTG CGGCGAAGTG AGGCCGGCCT CTGAGGTGAA GCTTAGGGCC TACCAAGAAG AGGCGCTGAG GGCGTGGATG AGGACTAAGA GGGGCGTCGT AGTGATGCCC ACTGGCTCGG GCAAAACCCA CGTGGCAATA GCCGCAATAG CCCAGCTTAA AGAGCCGGCG CTTGTGGTAG TGCCTACGGT AGAGCTAGTG CAACAGTGGC ACGCCAAGCT TAGGCACTAC TTCCCCGGAA GGGTGGGGGT GTGGTATGGT GAGGAGAAGA GGGAGAGTTG CATTACCGTA ATCACCTACG ACTCGGCATA CACAGCCGTT GAGGCTATCG GCAATAGGTA CAAGTTGCTG GTATTCGACG AGGTGCACCA CCTACCTTCC CAATCCTACC GGCAAATAGC TGAGCTAAGC CCAGCGCCGC ACCGACTCGG CCTAACCGCC ACGCCGGAGA GGGCAGATGG GCTCCACGTA GACCTAGACT GGCTCGTTGG CCCAGTAGTT TACCGGATTA CCGCCTCTGA AATAAGAGGA GTCTGGACGG CCGACTACGA GATTGAGATT ATAAAAGTAA GGCTTAGGGA AAACGAGGCG AAGTTATACA AAGAGCTCGA AGCCAAATAC CTCGCTTATT TGAGAAAGAA AGGCCTCAAG TTCAGATCCC CCTCTGATTT CCAAAAACTC GTAATACTAT CAGGCCGCGA TCCCCGCGCC AAGGAGGCGC TGGACGCTTG GCATGAGATG AGGCGCCTCG TACTGGAGAC AGAGGCGAAG GTAGACGCCG TCGGGGAAAT ACTGAGTAGG CATAGAGGAT CAAAAATACT CATATTTACC GAATACACAT CGCTGGCGAG GTCGGTCTCG GAGAGGTATT TGATCCCGCT GATTACCCAC GACATGTCCC CCTACGAGAG GGAGCAGATT ATGGCCATGT TCAGAAGAGG CGAGGTAAAA GCCATCGTCA CAGGCAAAGT ATTAGACGAG GGGGTAGATG TGCCCGACGT CGACGTCGTG GTAATACTTG GAGGCACTTC CAGTGCTAGG CAATTCATCC AGCGGATGGG TAGGGCGCTT AGGCTTAAGC CCCACAAGGC CAAGATATAC GAAGTGGTCA CCGCCAGCAC TAGGGAGGTC CACACAGCAC GTAGGCGGAA AAAGGGGGTT TCGTGA
|
Protein sequence | MGLVITWDRG TILLEGEVPN EIKTLSFIKF DGRVGKYRAL AIYYPRLLAV AKSLGQEVED RVWGLQCGEV RPASEVKLRA YQEEALRAWM RTKRGVVVMP TGSGKTHVAI AAIAQLKEPA LVVVPTVELV QQWHAKLRHY FPGRVGVWYG EEKRESCITV ITYDSAYTAV EAIGNRYKLL VFDEVHHLPS QSYRQIAELS PAPHRLGLTA TPERADGLHV DLDWLVGPVV YRITASEIRG VWTADYEIEI IKVRLRENEA KLYKELEAKY LAYLRKKGLK FRSPSDFQKL VILSGRDPRA KEALDAWHEM RRLVLETEAK VDAVGEILSR HRGSKILIFT EYTSLARSVS ERYLIPLITH DMSPYEREQI MAMFRRGEVK AIVTGKVLDE GVDVPDVDVV VILGGTSSAR QFIQRMGRAL RLKPHKAKIY EVVTASTREV HTARRRKKGV S
|
| |