Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2154 |
Symbol | |
ID | 5054930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1929358 |
End bp | 1931310 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469706 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001154352 |
Protein GI | 145592350 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.508076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTCGTT TCGTCGTAGT GGTCAACGGT GTTGAGATTA CAAGAGAGTG GCCGTGGGAG CGCATATACT CTGTGAAAGA GGAGTTGAAG AAGATGGGTT TTAGGTGGGA CGGTGCTGGC TGGAGAGGGA GGACGCAAGA CCTCGCTGTG ATTAACAGAC TGCGGCAACT CTTAGAGCTT AGCCACGAGG AGTACATGGC TGTTGTGTCT GCAATTGCCC GAGCCTCCTA CGGAGGTGCT GTGGTCGTAG TTGGCAGACT GCCAGAGGAC TTGAAGCCCC ACGTCTTGGC GTCGGACGGG GACACGCATT TGGTTTCCCT AACCGGCTTT TTGAGGAGAT TCGTGGCGGA GGACAAGTCC ATATCTCGCG TCTCGACGCT CGAGGAGTTT GTCAGTCTGG GAGTGGAGAG GTTACGAGAG TTGCTGAGGG GGGCGGAGGT CTGGGGGGAT TTAGGAAAGG CGCTTGAGGA GGCCAAGGAG TTTGTTTTGG AGTCTGAAAA GTTGAGGGCT GTCTTTGAGA AGAGGCGTAG CTGGAGGAGG GCCTTGGTTG GGAGCAACGA GGCGAGGCTG AACTTCCTCG CTTCCGGCCT TTTGAGGAGG GTGGGCGAGT TTAAGCTGAG GTACAACATC GTGAATAAAG ACGGCGAGCT GGTGGAGCGT AGCATACGCC TTGTGGAGGT AAAGGAGGGA GACAGCGGCT ACGTCCTACG CTTTCCAGTA TTTCTCAGGG ATAGAGTGGT TAAAACGCTG GAGGAGCTGG GCTATGTAGT AGAGCACAAA CAAGTAGAAT ACCCGAAAGT CGCCTTCAAG AAGGATTTCA GCCTCTTTCC CTTCCAGTCC GAGGCTGTGG ACAATTGGGC GGCGCACGGG ATGAGGGGCA CTGTGATTAT ACCGACAGGC GGTGGGAAGA CCTTCGTGGG TCTAGAGGCC ATGTACAGAG CTGGGGTCTC CGCATTGGTG CTCGTGGTGA CAAAAGAGCT CGCCTCTCAG TGGCGGGAGA GGATTAGGAG ATTTCTCGGC GTATATCCTG GAATGCTAGG GGGCGGGGAG AGGGATGTGC GTAGCGTCAC AGTGGCGATT TACAATTCAG CGGTTAAGTA CGTGGAGGAT TTGATCGGCA AATTCGGCCT TGTGGTTTTT GACGAGGCCC ATCATGTGCC GGCAGAGACT TTTAAGGAGG TGGCTCTGAG CCTAGACTCG CCGTATAGGT TAGCGCTTTC GGCAACGCCG GAGAGGGAGG ATAAAAACGA GCACCTAATA TACGAGGCGG TAGGCCCCCC CATATACAGG GCCTCCTACC GGTCTATGAT CGAGTCCGGG CTCGTGGTGC CGGTAGAACA CTATAGAATC TACGTCCGCA TGACGAAAGA GGAGGAGGAG GCGTACTCTT CTCTCCGTAG CGACAACGCA ATTATGTTGA GGAACGCCGC GGCGAAGGCT TCGAGGAAGA TCCCCGTGGC GGTCCGCATA ATTGCACACG AAGTAATGTT GGGGTCGAAA GTACTGGTCT TTACCCAGTT TATAGAACAG GCGGAGGAGC TCCACGACGC GCTTAGGGAG AGCGGCATTT CAGCAGAGCT TATAACTTCA GAAGAGGGCG GCCGCGACGC CGCGTTTAGA CGTTTTAGCA ACGGGCTGAG CAGAGTTGTG GTGACGACTA CAGTGCTCGA CGAGGGGGTT GACGTGCCCG ACGCGGATGT GGCGGTGGTT GTCAGCGGAA CAGGTTCTAG GAGGCAGATG ATACAAAGAG TTGGGCGCGT CGTGAGAGCC ACACAAGGCA AAAAAGCGGC GAGGGTATAC GAAATCATAA CCCGCAACAC TATTGAGGAG GCGCTTTCAG AGGCGAGGCA CTTCGACGAC ATCGTGGAGG AGCTTGTGTG TAAGAGAATC CCCGAGTCCG ACCTAGACGC GCTACTGTCC CGGGCCCCGC CGCTGTTTAA ATGGATGAAG TAG
|
Protein sequence | MARFVVVVNG VEITREWPWE RIYSVKEELK KMGFRWDGAG WRGRTQDLAV INRLRQLLEL SHEEYMAVVS AIARASYGGA VVVVGRLPED LKPHVLASDG DTHLVSLTGF LRRFVAEDKS ISRVSTLEEF VSLGVERLRE LLRGAEVWGD LGKALEEAKE FVLESEKLRA VFEKRRSWRR ALVGSNEARL NFLASGLLRR VGEFKLRYNI VNKDGELVER SIRLVEVKEG DSGYVLRFPV FLRDRVVKTL EELGYVVEHK QVEYPKVAFK KDFSLFPFQS EAVDNWAAHG MRGTVIIPTG GGKTFVGLEA MYRAGVSALV LVVTKELASQ WRERIRRFLG VYPGMLGGGE RDVRSVTVAI YNSAVKYVED LIGKFGLVVF DEAHHVPAET FKEVALSLDS PYRLALSATP EREDKNEHLI YEAVGPPIYR ASYRSMIESG LVVPVEHYRI YVRMTKEEEE AYSSLRSDNA IMLRNAAAKA SRKIPVAVRI IAHEVMLGSK VLVFTQFIEQ AEELHDALRE SGISAELITS EEGGRDAAFR RFSNGLSRVV VTTTVLDEGV DVPDADVAVV VSGTGSRRQM IQRVGRVVRA TQGKKAARVY EIITRNTIEE ALSEARHFDD IVEELVCKRI PESDLDALLS RAPPLFKWMK
|
| |