Gene Pars_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2154 
Symbol 
ID5054930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1929358 
End bp1931310 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content56% 
IMG OID640469706 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001154352 
Protein GI145592350 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.508076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCGTT TCGTCGTAGT GGTCAACGGT GTTGAGATTA CAAGAGAGTG GCCGTGGGAG 
CGCATATACT CTGTGAAAGA GGAGTTGAAG AAGATGGGTT TTAGGTGGGA CGGTGCTGGC
TGGAGAGGGA GGACGCAAGA CCTCGCTGTG ATTAACAGAC TGCGGCAACT CTTAGAGCTT
AGCCACGAGG AGTACATGGC TGTTGTGTCT GCAATTGCCC GAGCCTCCTA CGGAGGTGCT
GTGGTCGTAG TTGGCAGACT GCCAGAGGAC TTGAAGCCCC ACGTCTTGGC GTCGGACGGG
GACACGCATT TGGTTTCCCT AACCGGCTTT TTGAGGAGAT TCGTGGCGGA GGACAAGTCC
ATATCTCGCG TCTCGACGCT CGAGGAGTTT GTCAGTCTGG GAGTGGAGAG GTTACGAGAG
TTGCTGAGGG GGGCGGAGGT CTGGGGGGAT TTAGGAAAGG CGCTTGAGGA GGCCAAGGAG
TTTGTTTTGG AGTCTGAAAA GTTGAGGGCT GTCTTTGAGA AGAGGCGTAG CTGGAGGAGG
GCCTTGGTTG GGAGCAACGA GGCGAGGCTG AACTTCCTCG CTTCCGGCCT TTTGAGGAGG
GTGGGCGAGT TTAAGCTGAG GTACAACATC GTGAATAAAG ACGGCGAGCT GGTGGAGCGT
AGCATACGCC TTGTGGAGGT AAAGGAGGGA GACAGCGGCT ACGTCCTACG CTTTCCAGTA
TTTCTCAGGG ATAGAGTGGT TAAAACGCTG GAGGAGCTGG GCTATGTAGT AGAGCACAAA
CAAGTAGAAT ACCCGAAAGT CGCCTTCAAG AAGGATTTCA GCCTCTTTCC CTTCCAGTCC
GAGGCTGTGG ACAATTGGGC GGCGCACGGG ATGAGGGGCA CTGTGATTAT ACCGACAGGC
GGTGGGAAGA CCTTCGTGGG TCTAGAGGCC ATGTACAGAG CTGGGGTCTC CGCATTGGTG
CTCGTGGTGA CAAAAGAGCT CGCCTCTCAG TGGCGGGAGA GGATTAGGAG ATTTCTCGGC
GTATATCCTG GAATGCTAGG GGGCGGGGAG AGGGATGTGC GTAGCGTCAC AGTGGCGATT
TACAATTCAG CGGTTAAGTA CGTGGAGGAT TTGATCGGCA AATTCGGCCT TGTGGTTTTT
GACGAGGCCC ATCATGTGCC GGCAGAGACT TTTAAGGAGG TGGCTCTGAG CCTAGACTCG
CCGTATAGGT TAGCGCTTTC GGCAACGCCG GAGAGGGAGG ATAAAAACGA GCACCTAATA
TACGAGGCGG TAGGCCCCCC CATATACAGG GCCTCCTACC GGTCTATGAT CGAGTCCGGG
CTCGTGGTGC CGGTAGAACA CTATAGAATC TACGTCCGCA TGACGAAAGA GGAGGAGGAG
GCGTACTCTT CTCTCCGTAG CGACAACGCA ATTATGTTGA GGAACGCCGC GGCGAAGGCT
TCGAGGAAGA TCCCCGTGGC GGTCCGCATA ATTGCACACG AAGTAATGTT GGGGTCGAAA
GTACTGGTCT TTACCCAGTT TATAGAACAG GCGGAGGAGC TCCACGACGC GCTTAGGGAG
AGCGGCATTT CAGCAGAGCT TATAACTTCA GAAGAGGGCG GCCGCGACGC CGCGTTTAGA
CGTTTTAGCA ACGGGCTGAG CAGAGTTGTG GTGACGACTA CAGTGCTCGA CGAGGGGGTT
GACGTGCCCG ACGCGGATGT GGCGGTGGTT GTCAGCGGAA CAGGTTCTAG GAGGCAGATG
ATACAAAGAG TTGGGCGCGT CGTGAGAGCC ACACAAGGCA AAAAAGCGGC GAGGGTATAC
GAAATCATAA CCCGCAACAC TATTGAGGAG GCGCTTTCAG AGGCGAGGCA CTTCGACGAC
ATCGTGGAGG AGCTTGTGTG TAAGAGAATC CCCGAGTCCG ACCTAGACGC GCTACTGTCC
CGGGCCCCGC CGCTGTTTAA ATGGATGAAG TAG
 
Protein sequence
MARFVVVVNG VEITREWPWE RIYSVKEELK KMGFRWDGAG WRGRTQDLAV INRLRQLLEL 
SHEEYMAVVS AIARASYGGA VVVVGRLPED LKPHVLASDG DTHLVSLTGF LRRFVAEDKS
ISRVSTLEEF VSLGVERLRE LLRGAEVWGD LGKALEEAKE FVLESEKLRA VFEKRRSWRR
ALVGSNEARL NFLASGLLRR VGEFKLRYNI VNKDGELVER SIRLVEVKEG DSGYVLRFPV
FLRDRVVKTL EELGYVVEHK QVEYPKVAFK KDFSLFPFQS EAVDNWAAHG MRGTVIIPTG
GGKTFVGLEA MYRAGVSALV LVVTKELASQ WRERIRRFLG VYPGMLGGGE RDVRSVTVAI
YNSAVKYVED LIGKFGLVVF DEAHHVPAET FKEVALSLDS PYRLALSATP EREDKNEHLI
YEAVGPPIYR ASYRSMIESG LVVPVEHYRI YVRMTKEEEE AYSSLRSDNA IMLRNAAAKA
SRKIPVAVRI IAHEVMLGSK VLVFTQFIEQ AEELHDALRE SGISAELITS EEGGRDAAFR
RFSNGLSRVV VTTTVLDEGV DVPDADVAVV VSGTGSRRQM IQRVGRVVRA TQGKKAARVY
EIITRNTIEE ALSEARHFDD IVEELVCKRI PESDLDALLS RAPPLFKWMK