Gene Pars_2323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2323 
Symbol 
ID5056336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2080652 
End bp2081791 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content51% 
IMG OID640469875 
ProductDNA-directed RNA polymerase subunit A'' 
Protein accessionYP_001154519 
Protein GI145592517 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02389] DNA-directed RNA polymerase, subunit A'' 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0147496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000206974 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATCTCGA AGCAGGAGCT TTTGGCAAAA ATACAACACG TACTGCCGAG GCCGCTGTAC 
GCTGAGATTG AAAACGCAGT GAAGGATCTA GACGATGAGA GGGCCCTCCG CCTCATATAC
CGCGTGCTGA GGCTTTACCT AAACTCGTTG ATAGACCCTG GGGAGGCCAT AGGTATTGTA
ACGGCGCAGT CGATAGGCGA GCCGGGCACC CAGATGATCC TCCGCTCTTT CCACTACGCG
GGTCTGAGGG AGTTCTCCAT GGCGCGTGGT CTGCCGAGGC TTATAGAGGT GGTTGACGCC
CGGAGGACTC CCTCAACGCC TTTGATGTAT ATCTATTTGA AGCCTCCGCA TAATAAGAGC
CGAGAGGCGG CGGAGGCTGT GGCAAAGAAG ATACAACAAG TAACTCTAGA GATGTTGGCA
AAGGAGGTGG ATGTAGATTA TATAAGTGGC GCCGTCACAA TTGAGCTGGA CACAGAGCAG
TTGAAGTATA GAGGTTTGAA CATAAAGGAG GTGGAGAAGA TTGTCAGCAA GGCGAGGGGG
AAGGACTTGT CAATCTCTTT CCGTGGCCAC ACAATTACAG TTACGCTTAC CTCGCCTGAT
ATTTTTAAGC TAAGAAAAGT CAGAGATAAG ATACTGCAGA TAAAAGTAGC TGGGATAAAG
GGAGTGAGGA AAGTGGTGCT CCAGTACGAC TCAAAAGCAG ATGAGTGGTT TATCGTAACT
GAGGGGACAA ACTTGGAGGC GGTGCTACAA CTTGAGGAGG TAGACGCCAC GAGAACCTAC
AGCAACGATC TCCACGAGGT GGAGGAGGTG TTAGGCATAG AGGCGGCCAG GGCTTTAGTG
GCACAAGAAA TCAAGAGAGT TCTTGACGAG CAGGGCCTAG ATGTTGATAT TAGGCATATG
TACATGGTGG CTGACACCAT GACTTGGTCA GGTAGGCTCA GGCCAATAGG ACGGCACGGA
GTTGTTGGGA CTAAGGAATC CCCCTTGGCG CGCGCCGCCT TTGAGGTCAC CGTCAAGACG
CTGATTGAGG CCTCTGTAAG AGGCGAAGAG GAGGCCTTTA AGGGAGTGGT CGAGAGCATA
ATTGCGGGGA AGTATATACC CATCGGTACC GGCATCGTGC GCCTTTTGAT GCAGTTCTAA
 
Protein sequence
MISKQELLAK IQHVLPRPLY AEIENAVKDL DDERALRLIY RVLRLYLNSL IDPGEAIGIV 
TAQSIGEPGT QMILRSFHYA GLREFSMARG LPRLIEVVDA RRTPSTPLMY IYLKPPHNKS
REAAEAVAKK IQQVTLEMLA KEVDVDYISG AVTIELDTEQ LKYRGLNIKE VEKIVSKARG
KDLSISFRGH TITVTLTSPD IFKLRKVRDK ILQIKVAGIK GVRKVVLQYD SKADEWFIVT
EGTNLEAVLQ LEEVDATRTY SNDLHEVEEV LGIEAARALV AQEIKRVLDE QGLDVDIRHM
YMVADTMTWS GRLRPIGRHG VVGTKESPLA RAAFEVTVKT LIEASVRGEE EAFKGVVESI
IAGKYIPIGT GIVRLLMQF