Gene Pars_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1787 
Symbol 
ID5055591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1607300 
End bp1608238 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content50% 
IMG OID640469332 
Productputative DNA primase, small subunit 
Protein accessionYP_001153990 
Protein GI145591988 
COG category[L] Replication, recombination and repair 
COG ID[COG1467] Eukaryotic-type DNA primase, catalytic (small) subunit 
TIGRFAM ID[TIGR00335] DNA primase, eukaryotic-type, small subunit, putative 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.057405 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAACAG AGGTATTTTT CCGCAACTTT TACAGAAACT ACGCAAAGTT CGACGTAGTA 
TCTGTGGAAA GGAGAGAATT CGCCTTTCAA CCCTTCGGCG GAGGGATGGT TAGGCACAAA
TCTTTCAATT CTGTGGATGA GCTTAGGAGG TACATCGTGG AGAAAACCCC GAAGCATATC
TACCACTCAG TGGCGTACTA CGAAAGGCCC GGCGAGGAGG ATATGGACCG GAAGGGATGG
CTCGGCGCCG ATCTCGTATT TGACATCGAC GGCGACCACC TCAACACCGA GGCTTGTAAA
GGCAGTGCGG TGGTGTCCTT ACGTTGCCTC GAAGACGCCA AGGAAGAGAC CAACAAGCTG
ATAGACATCC TTGTGCGCGA GCTCGACCTC AGACCAACCC GAATAGTATT TTCTGGGAAC
AGGGGCTTCC ACATTCACAT CACAAGCGAG GAGGTTCTAA AGCTGGGGAC CAAGGAGAGA
AGAGAAGTCG TTAATTTCAT AAAGGGCGTC GGCTTCGATC CCAGTAGGTT TGAGGTGAAG
CTAGGTAGAA GGAGAGTGAA GCTCTACGAG GAAGAGCCGG TGGGTAGCCT CTTGAGAGTG
AGACAAGCGG TGGAGAACCC CGACACGCTG AGAGTCGAAA TAGACGAAGT AGTGACTCAG
GACATCCACC GCCTCATAAG ATTGCCCGGC TCTCTCAACG GGAAGACAGG ACTCGTGGCC
ATGCCTCTGG AACTGAAAGA CCTAGAAAGA GGCGTTGAGA ACATCGTCGA ACGCGCCATT
GCGTTTAGGA AAGGCAATTT AAAATTCAGA TTTGAAAAGC CGCTTATTGG TGAGGTGCTC
TTCGAAAAAA TAGAGGCCCG TGCGGGGGAT CTGAAAATTT TGCCAGCCCA CGTGGCAATA
TATTTAGAAC TCCAAGAGTT TGGGAAAATA TATGATTGA
 
Protein sequence
MITEVFFRNF YRNYAKFDVV SVERREFAFQ PFGGGMVRHK SFNSVDELRR YIVEKTPKHI 
YHSVAYYERP GEEDMDRKGW LGADLVFDID GDHLNTEACK GSAVVSLRCL EDAKEETNKL
IDILVRELDL RPTRIVFSGN RGFHIHITSE EVLKLGTKER REVVNFIKGV GFDPSRFEVK
LGRRRVKLYE EEPVGSLLRV RQAVENPDTL RVEIDEVVTQ DIHRLIRLPG SLNGKTGLVA
MPLELKDLER GVENIVERAI AFRKGNLKFR FEKPLIGEVL FEKIEARAGD LKILPAHVAI
YLELQEFGKI YD