Gene Pars_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1118 
Symbol 
ID5055201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1007542 
End bp1010178 
Gene Length2637 bp 
Protein Length878 aa 
Translation table11 
GC content65% 
IMG OID640468674 
ProductCRISPR-associated RAMP Crm2 family protein 
Protein accessionYP_001153348 
Protein GI145591346 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTGGT TTGACAAGGC GTGGGCCCTG CTCCACGACC CGCCCTACAA GGCCCTCTGG 
CCACTGGGCT ACAAGCCGCT GGGAGGGAAG ACCCACGAGG AGGAGGCCAA GCGCTTGATG
GCGGCTCTCC TGGGCGGCAC CAAGCTCGGC GGCGGGGCGC CGGACGAGAG GACGTCTAGG
ATAGTCGCCG CCGCTGACAG GCTTGCCTCT TCCTTCGACC GCTGGGCCCT CTCCACGGAG
GGCGAGGCGA AGTACTGGGT CAAGCCGGCG GAGCTGGTCA ACCCCTTCAA CCCGGCGTAC
GCGGCAAGAG TAGAGCCGCC GCCCCCGGAG CAGTTCGGGG GACGGATAGA AAGCTTCGTA
AAAAACGTAA ACCGAGTAGT TAGGGAGGCC GGCGATGAGA AAGAGGCCTA CTTCGCCCTG
TACGCGGTGT ACGAGCTTGC GTGGATCGAG GCCGGCCTGC CGGCCCTGCC AGCAGACACG
AGGGTCCCCA CCCACACCAT ATTCGACCAC CTATACGCCA CCGCATCTGT GATGAACTGG
GTCGGCGACG GCGGCGAGCC GAGGGGGGAC GCCTGCCTTC TTGAAATCGA CATCCCAGGC
ATCCAGAAGG TGATCTCCTC AGCGAGGAAG GCCGGGGACT ACCGCGCCGG CAGCATGTTG
GTCTCCCTGG CTATTTGGGG CACGGCGTGG AGGTACATGG ACAAACACGG CCCCGACGTG
TTGCTCTCCC CATCCCCCCG CTTCAACCCC TTCCTGTACC TCCAGCTGAG GCGCCTCTAC
GGCTGGGGGG AGTCGGCGCT TCGGCTGTAC AGAAAGGTGG CGGGCATGGC ACTCGGGGCG
GACGTCGCGG CGCTGTTGGA CAAAACGCCC CTGGTCCCAG GCACGGCGTA TTTAGCCCTC
CCCAGCTGCT CCGACGCGGA GAGGGCAGTG GAGCACTTCG AGGACGCGCT CGACGAGATA
AGGGCCATGG TCCTGGGGGA GAGGGAGGCG AAGCTCCCCC TAGCGGGCTC AACGACTGGC
GACGTCTTGA AAATAGCCAA GGCGGCCTTG GAGGTTGCGC CAAGGAGGTA CCTCCCAGTG
AGGGTGCGCT ACGCCTCTAT CTCAGAGGCG TGGGGCGCGG CGGAGGAGGC CGCCCGGGAG
GTCAGCAGGG AGGCGGGGTT CGAGGTGGAC CCCGCCAGGT TCATCTTCTG GGCCTTGATG
AAAGTCTTGA AAGAAAAGCC GGCGGTGCCC CACCCAGTGG CCTGGTTCGA CAAGGGCGGC
GCCCCCAGGT TTGTTAAGAG GTACGGAGGG CCGTGGATCT ACAGTAGCCT AGACCCAGAC
CAGCCAGCGG TCCTGAAGCT CTCGGGCGTT GTAACCCCCC AAGGGGTTGA CTACGACGAG
GAGGCCAAGG CCGCCTTGGC CCAGATCGGG GTGCAGAACC TCGCCGAGTT GGCCAAGGTC
TTTAGGCCAA AGGAGGCCCT CGGCCCAGTA GACGTGCTCA AGCGGGCTCT GTACTACGGG
GTGTCCAGGG ATAGGGTGGA GTCGGTGGAG GCGGTGGCCC TTAGGTGGCA CTACCGCCGT
GGGTATTTCA AAAACTGCCC AGACCTGCAG CGGAGGGTGG AGGAGGTCTT GCAAGGCGCC
GACGCCGAGG CGGTCTTCAC GTCGCCGGAA GCCGCCGACA AGACCCTGGC CCAGGCCCCC
CAGCTCTGCG GAGGCGACCC GGCCATCCCC GCCCCCACCC TCATGTACGC CATCATCAGG
GCAGACGGAG ACAACATCGG CAAGCTGATA TCCGGCTGCC TACCCCCCGC ACAGCGGCCC
CCCGTCGAGG ATCGGCTTGT AGAAGGCGAC AAAGGCCAGT GGGAGGAGGA CCAGAGGCGG
CTGGAAAAGC TAGAAAAGGC GGTCCAGGTC CTGGGGGAGA AGGCCAAGTG CCGGGGCGCC
GGCGGCGGGG CTGAGGCGAG GTACGCGGTG CCGTCGCCTG CCTACTACGC CGCCCTCTCC
GCCTCTATGA TGATAACGGC GCTGAAAGAC GCGTACATCG TGGCCAAGCA CGACGGCGAG
GTGGTCTTCG CCGGCGGCGA CGACTTGCTG GCCTTCGCCC CCCTTGCCGC GGCCTTCCAC
ATCGTGAAGG AGAGCAGAGA GGCCTACTGG GGAGAAGGCG GCTTCCACAA AATAGGCCCC
TACTCCCTCC CAGCCCTCGC GGCCTATGGA AAAAGCTACT CGGTAAGAGC CGCCCACTCC
ATTACGGACT TCATGGCCAT AGAGGTGGAG GAGGCGACCC GACTACTGGA AAAGGCCAAG
GAGGCGGTCA GAGGCAAAGA CGCGCTGGCC ATCTCCACCT CCACCGGCCA CGCGGGGTTC
GCCAAGGCGC GCCACGCGGC GCTTGTGGAG CAGATAGCGG AGGCCTACAG GACGGGCAAC
CTCAGCAAAA ACCTCCCCTA CGACCTCGAG AGGTGGGCCG GCGACGGGCT GAGGTGCGGA
GGCGGCGAGA CGTGCAGAGA GGCGGCCAGC ATAATCCTCA CCTACGTGGC CAGCCGCAAC
TCCAAAAACG GCATCCCAAG CCGCTTGGAG GAGCTACTCG ACGCCGTGGC CGACGGGGCG
GATGCGGCAT TGAAAAACGC CGCGGAGCTC CTAAAAGCGG CGAGGGAGTG GGCATGA
 
Protein sequence
MGWFDKAWAL LHDPPYKALW PLGYKPLGGK THEEEAKRLM AALLGGTKLG GGAPDERTSR 
IVAAADRLAS SFDRWALSTE GEAKYWVKPA ELVNPFNPAY AARVEPPPPE QFGGRIESFV
KNVNRVVREA GDEKEAYFAL YAVYELAWIE AGLPALPADT RVPTHTIFDH LYATASVMNW
VGDGGEPRGD ACLLEIDIPG IQKVISSARK AGDYRAGSML VSLAIWGTAW RYMDKHGPDV
LLSPSPRFNP FLYLQLRRLY GWGESALRLY RKVAGMALGA DVAALLDKTP LVPGTAYLAL
PSCSDAERAV EHFEDALDEI RAMVLGEREA KLPLAGSTTG DVLKIAKAAL EVAPRRYLPV
RVRYASISEA WGAAEEAARE VSREAGFEVD PARFIFWALM KVLKEKPAVP HPVAWFDKGG
APRFVKRYGG PWIYSSLDPD QPAVLKLSGV VTPQGVDYDE EAKAALAQIG VQNLAELAKV
FRPKEALGPV DVLKRALYYG VSRDRVESVE AVALRWHYRR GYFKNCPDLQ RRVEEVLQGA
DAEAVFTSPE AADKTLAQAP QLCGGDPAIP APTLMYAIIR ADGDNIGKLI SGCLPPAQRP
PVEDRLVEGD KGQWEEDQRR LEKLEKAVQV LGEKAKCRGA GGGAEARYAV PSPAYYAALS
ASMMITALKD AYIVAKHDGE VVFAGGDDLL AFAPLAAAFH IVKESREAYW GEGGFHKIGP
YSLPALAAYG KSYSVRAAHS ITDFMAIEVE EATRLLEKAK EAVRGKDALA ISTSTGHAGF
AKARHAALVE QIAEAYRTGN LSKNLPYDLE RWAGDGLRCG GGETCREAAS IILTYVASRN
SKNGIPSRLE ELLDAVADGA DAALKNAAEL LKAAREWA