Gene Pars_0112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0112 
Symbol 
ID5054945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp97498 
End bp99630 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content63% 
IMG OID640467691 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001152379 
Protein GI145590377 
COG category[R] General function prediction only 
COG ID[COG1204] Superfamily II helicase 
TIGRFAM ID[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.757622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTGG ATGTCTCTGA GCTTCCTCTT GACGGGAGGC TTATCTCTGT CCTCAAGGAG 
AGGGGCGTGA GGGAGCTGTT CCCGCCGCAG GTGGAGGCGG TTAAGGCCGG GATCTTCGAC
GGGGCTAACG TCCTCTTGTG CACAGCGACT GCTTCTGGGA AGTCCCTCCT AGCCGAGGTG
GCCTCAGTAA AGGCGGCCCT GGAGGGGAGG ATGGCCTTGT ACGCGGTGCC GCTTAAGGCT
TTGGCCCAGG AGAAGTTGCT CCACTTCTCC CACTACAGGA GCCTAGCAAA GGTCGGCATA
TCTACGGGGG ATTTTGAGTC CGACGACAGG AGGCTCTACG AGTACGACGT GGTGGTGGTG
ACTTACGAAA AGCTGGACAG TCTCCTCCGC CACCGGCCGA GCTGGCTGAG CTCCGTCGGC
GTGGTGGTTG TCGACGAGAT ACACTACCTG GGCGACCCCA AGAGGGGGCC CGTCCTGGAG
TCTATAATTG CCAAGATAAG GCACCTGGGC CTAAGGGCTC AGTTCATCGG CCTAAGCGCC
ACGGTGGGGA ACGCGGCGGA AGTTGCGGAG TGGCTTGGCG CTCGGCTGGT AAAGTCGAGC
TGGCGGCCTG TGCCGCTTAG GGAGGGGGTC TACTACGGCG GCAGGATATA CTTCCCCGAC
GGGGCCCACA AGGCTGTCGG CGCCTCCGGC GAAGCCGAAG TGGCCCTTGC CCTGGACGCG
GTGGCCGGCG GGGGGCAGGC GCTGGTCTTC ACGAACAGCA GGTCGTCAAC TGCGCGTATT
GCAAAGGCGG TGGCTAAGGC CGTGGCGGCC TACCCGGCGC AGCTCATAAA TCCTGGCGAG
GCGCGGGCCT TGGCTGAGGA GGTGCTTAGG GTCTCTTCCA GCAAGATCAT CGGTAGGGAG
CTGGCTGAGC TGGTGGCACG CGGCGTGGCC TTCCACAACG CGGGTCTCGA GCTGGAGGTG
AGGAGGCTCG TGGAGGAGGG GTTTAGGAGG GGGGTTGTAA AGGTCGTGGT GTCGACGACG
ACTCTGGCGG CTGGGGTCAA CCTGCCGGCT AGGCGCGTCG TGGTGGCGGA ATACGAGAGG
TACGACCCAG TGGTGGGGAG GGAGGAGATA CCTGTGTTGG AGTATAGGCA GATGGCGGGG
CGTGCGGGGC GGCCGGGGCT GGATCCCTAT GGCGAGGCCG TTATTGTCGC CAGGAGCAGG
GGGGATGTGG AGTACCTCAT GGAGAGGTAT GTCATGGGGC AGGTGGAGAA TGTGAGGTCG
CACATCCTCT CTGCCCCCAA TCTCAGATCC CACGCGCTCG GTGCCGTGGG CGGCGGCTAC
GCCAAGTCGG TAGACGACCT CGTGGACTTC TTCTCAAACA CCTTGGGCTT TCACCAGGCC
AAGACGCCTC TTAAGTCCTC CCTCCTCCGG TCTAAGGTGG CTGAGGCCCT GGACGAGTTG
GTGGAGTGGG GATTCCTGGA GCGGGATGGC GACGTGGTGT ACGCCACTGA GCTCGGCAGG
CAAGTGGCGA GGCTGTACCT GGATCCCGAA GTCGCCGCCC GCTACCTCTC CATGCTGAGG
TCTATGCGGA GGGGGTCTAT CTACGCGTAC CTCTACGTCG TATTGACCGC TCCTGATTTC
CCCAGGGTTA GGCGGGGGAG GTTGGCGGCG GATGTAGCCC GCGAGGTTTT GGCGGCTCTG
CCCGACGTCG AGGAGGACGA GGAGTTTGAA GACGTGGCGA GAACAGCGGC GATGTTGATG
GCGTGGATAG AGGAGGAGGA TGAGGACAAG ATATACGAGC GCTTCGAGGT GGCGCCTGGC
GACTTGAGAG TGTACGTGGA CCTCTTCGAG TGGCTTGGAA ACGCCGCGGC TAAGCTGGCC
GGCATGGTGG GGCTTGAGGA GCACAGAAGA AGTCTAGAGG TCCTCACCGC GAGGGTGGTG
CACGGCGTGA GGGAGGAGCT CATCCCGCTG GTCACTGCTC TTAGAGGCGT GGGGAGGGTC
CGCGCCAGGG TGTTGTACAA CTTCGGCTTC CGGACTTTGA GAGACATCGC CAGGGCCTCT
GTGAGGGAGA TCGCCTCTCT CCCCGGCTTC GGCGAAAAAC TCGCCGAGTC TATCATAGAA
CAGGCAAGAC AGTTAGTAGA GGGCAACGCC TAA
 
Protein sequence
MAVDVSELPL DGRLISVLKE RGVRELFPPQ VEAVKAGIFD GANVLLCTAT ASGKSLLAEV 
ASVKAALEGR MALYAVPLKA LAQEKLLHFS HYRSLAKVGI STGDFESDDR RLYEYDVVVV
TYEKLDSLLR HRPSWLSSVG VVVVDEIHYL GDPKRGPVLE SIIAKIRHLG LRAQFIGLSA
TVGNAAEVAE WLGARLVKSS WRPVPLREGV YYGGRIYFPD GAHKAVGASG EAEVALALDA
VAGGGQALVF TNSRSSTARI AKAVAKAVAA YPAQLINPGE ARALAEEVLR VSSSKIIGRE
LAELVARGVA FHNAGLELEV RRLVEEGFRR GVVKVVVSTT TLAAGVNLPA RRVVVAEYER
YDPVVGREEI PVLEYRQMAG RAGRPGLDPY GEAVIVARSR GDVEYLMERY VMGQVENVRS
HILSAPNLRS HALGAVGGGY AKSVDDLVDF FSNTLGFHQA KTPLKSSLLR SKVAEALDEL
VEWGFLERDG DVVYATELGR QVARLYLDPE VAARYLSMLR SMRRGSIYAY LYVVLTAPDF
PRVRRGRLAA DVAREVLAAL PDVEEDEEFE DVARTAAMLM AWIEEEDEDK IYERFEVAPG
DLRVYVDLFE WLGNAAAKLA GMVGLEEHRR SLEVLTARVV HGVREELIPL VTALRGVGRV
RARVLYNFGF RTLRDIARAS VREIASLPGF GEKLAESIIE QARQLVEGNA