Gene Pars_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1917 
Symbol 
ID5055252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1722707 
End bp1723756 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content55% 
IMG OID640469463 
ProductDNA primase large subunit 
Protein accessionYP_001154116 
Protein GI145592114 
COG category[L] Replication, recombination and repair 
COG ID[COG2219] Eukaryotic-type DNA primase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATGTG ATATTTCGCT TAGAGAATTC GCCTGCTACT TCCCCTTCCT CAATAAGTCC 
GCCTCATACC TACAGAAAAG GGGCTACCTA CTTGACGTGG CGCTTAGCGA TAAAAAACTC
TTGGAAAAGG CAGTTGAGAG GCTGAAGAGG GCCCTTGCCC ACGAGCGGAT AGCTCTCCGT
CCATGTATAG ACAGCCCCGA GGAGGCGGCA GCCGCGGCGA GGCTGGCCCT GTACATCGCC
GCAGCTACTA GAAATACCCA CGTGCTCCGC AGGTTTGCAG ACAGCGAAAG TAAAAATTTC
AAGGATATCC TAGAAAAAAC GCCTGGGATA CAAAGTCCAG AATGTAAGCT TGAAATAGCA
AGGGACCTAG GTATTGTTAC GAGGCAAGCC CAAGAAGTGG CGCCAGGCCT ATTGTCAGTG
GCCTACAAGA TGCCAATGGC CGTGAGGTGG ACTGCATATG TTCGCTACGC CCCCCAAGAT
CCGTACTGGG CTATGATAAA CCGGCCCGTC GTGAAGGGGT GGGTAATACT GCCAATTGAG
GATTTCGAGA GGTTGCTCGA GGAGGCGTAC GAGGAGCGGA TAGTTAGGAC TGTTGCTGAG
AACGAGCTTG CGGTGGGCAG AGTGGCCGCT TCGCTTGACC CCGCGCTGTT GGACGAGCTT
GTGAAGCAGT ACGGCCAGAG GCCTGTGCGG GTGGAGGCTA GGGCAATGCC GGGCCCTGAC
CCGCCCTGCA TGCGGGCGTT GATCGACGCG TTAAAGGCCG GCGAGAACCT CCCCCACACA
GGGAGGTTTG CCATAACTAC ATATTTGCTA CATAGGGGGT GGGATGTGGA GCAGATAGTT
GACCTCTTCA GAAACGCGCC CGACTTCAAC GAAAAGATCA CGAGGTACCA GGTACAGCAC
ATCGCCGGGC AGGCAGGGGG CAGGAAACAA TACTCGGTGC CCAGCTGTGA GACCATGAAC
TCTTGGGGCC TATGCCCCAC AAATCTCGGA TGCGGCATAA GAAACCCAGT AGTATATGGG
CGCAGAGTCG CGGCTAGAAA AAGTAGCTGA
 
Protein sequence
MTCDISLREF ACYFPFLNKS ASYLQKRGYL LDVALSDKKL LEKAVERLKR ALAHERIALR 
PCIDSPEEAA AAARLALYIA AATRNTHVLR RFADSESKNF KDILEKTPGI QSPECKLEIA
RDLGIVTRQA QEVAPGLLSV AYKMPMAVRW TAYVRYAPQD PYWAMINRPV VKGWVILPIE
DFERLLEEAY EERIVRTVAE NELAVGRVAA SLDPALLDEL VKQYGQRPVR VEARAMPGPD
PPCMRALIDA LKAGENLPHT GRFAITTYLL HRGWDVEQIV DLFRNAPDFN EKITRYQVQH
IAGQAGGRKQ YSVPSCETMN SWGLCPTNLG CGIRNPVVYG RRVAARKSS