Gene Pars_1432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1432 
Symbol 
ID5054831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1290555 
End bp1291910 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content55% 
IMG OID640468973 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001153642 
Protein GI145591640 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.503563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0109836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACTAG TTATTACCTG GGACAGGGGG ACTATACTAC TAGAGGGCGA GGTCCCCAAC 
GAGATTAAGA CGCTATCTTT TATCAAATTC GACGGGAGGG TCGGAAAGTA CAGAGCCCTG
GCGATATACT ACCCGCGGCT ATTGGCCGTG GCGAAGTCGC TTGGCCAAGA GGTGGAGGAC
AGGGTTTGGG GCCTCCAGTG CGGCGAAGTG AGGCCGGCCT CTGAGGTGAA GCTTAGGGCC
TACCAAGAAG AGGCGCTGAG GGCGTGGATG AGGACTAAGA GGGGCGTCGT AGTGATGCCC
ACTGGCTCGG GCAAAACCCA CGTGGCAATA GCCGCAATAG CCCAGCTTAA AGAGCCGGCG
CTTGTGGTAG TGCCTACGGT AGAGCTAGTG CAACAGTGGC ACGCCAAGCT TAGGCACTAC
TTCCCCGGAA GGGTGGGGGT GTGGTATGGT GAGGAGAAGA GGGAGAGTTG CATTACCGTA
ATCACCTACG ACTCGGCATA CACAGCCGTT GAGGCTATCG GCAATAGGTA CAAGTTGCTG
GTATTCGACG AGGTGCACCA CCTACCTTCC CAATCCTACC GGCAAATAGC TGAGCTAAGC
CCAGCGCCGC ACCGACTCGG CCTAACCGCC ACGCCGGAGA GGGCAGATGG GCTCCACGTA
GACCTAGACT GGCTCGTTGG CCCAGTAGTT TACCGGATTA CCGCCTCTGA AATAAGAGGA
GTCTGGACGG CCGACTACGA GATTGAGATT ATAAAAGTAA GGCTTAGGGA AAACGAGGCG
AAGTTATACA AAGAGCTCGA AGCCAAATAC CTCGCTTATT TGAGAAAGAA AGGCCTCAAG
TTCAGATCCC CCTCTGATTT CCAAAAACTC GTAATACTAT CAGGCCGCGA TCCCCGCGCC
AAGGAGGCGC TGGACGCTTG GCATGAGATG AGGCGCCTCG TACTGGAGAC AGAGGCGAAG
GTAGACGCCG TCGGGGAAAT ACTGAGTAGG CATAGAGGAT CAAAAATACT CATATTTACC
GAATACACAT CGCTGGCGAG GTCGGTCTCG GAGAGGTATT TGATCCCGCT GATTACCCAC
GACATGTCCC CCTACGAGAG GGAGCAGATT ATGGCCATGT TCAGAAGAGG CGAGGTAAAA
GCCATCGTCA CAGGCAAAGT ATTAGACGAG GGGGTAGATG TGCCCGACGT CGACGTCGTG
GTAATACTTG GAGGCACTTC CAGTGCTAGG CAATTCATCC AGCGGATGGG TAGGGCGCTT
AGGCTTAAGC CCCACAAGGC CAAGATATAC GAAGTGGTCA CCGCCAGCAC TAGGGAGGTC
CACACAGCAC GTAGGCGGAA AAAGGGGGTT TCGTGA
 
Protein sequence
MGLVITWDRG TILLEGEVPN EIKTLSFIKF DGRVGKYRAL AIYYPRLLAV AKSLGQEVED 
RVWGLQCGEV RPASEVKLRA YQEEALRAWM RTKRGVVVMP TGSGKTHVAI AAIAQLKEPA
LVVVPTVELV QQWHAKLRHY FPGRVGVWYG EEKRESCITV ITYDSAYTAV EAIGNRYKLL
VFDEVHHLPS QSYRQIAELS PAPHRLGLTA TPERADGLHV DLDWLVGPVV YRITASEIRG
VWTADYEIEI IKVRLRENEA KLYKELEAKY LAYLRKKGLK FRSPSDFQKL VILSGRDPRA
KEALDAWHEM RRLVLETEAK VDAVGEILSR HRGSKILIFT EYTSLARSVS ERYLIPLITH
DMSPYEREQI MAMFRRGEVK AIVTGKVLDE GVDVPDVDVV VILGGTSSAR QFIQRMGRAL
RLKPHKAKIY EVVTASTREV HTARRRKKGV S