Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2323 |
Symbol | |
ID | 5056336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2080652 |
End bp | 2081791 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640469875 |
Product | DNA-directed RNA polymerase subunit A'' |
Protein accession | YP_001154519 |
Protein GI | 145592517 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02389] DNA-directed RNA polymerase, subunit A'' |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0147496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00000206974 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATCTCGA AGCAGGAGCT TTTGGCAAAA ATACAACACG TACTGCCGAG GCCGCTGTAC GCTGAGATTG AAAACGCAGT GAAGGATCTA GACGATGAGA GGGCCCTCCG CCTCATATAC CGCGTGCTGA GGCTTTACCT AAACTCGTTG ATAGACCCTG GGGAGGCCAT AGGTATTGTA ACGGCGCAGT CGATAGGCGA GCCGGGCACC CAGATGATCC TCCGCTCTTT CCACTACGCG GGTCTGAGGG AGTTCTCCAT GGCGCGTGGT CTGCCGAGGC TTATAGAGGT GGTTGACGCC CGGAGGACTC CCTCAACGCC TTTGATGTAT ATCTATTTGA AGCCTCCGCA TAATAAGAGC CGAGAGGCGG CGGAGGCTGT GGCAAAGAAG ATACAACAAG TAACTCTAGA GATGTTGGCA AAGGAGGTGG ATGTAGATTA TATAAGTGGC GCCGTCACAA TTGAGCTGGA CACAGAGCAG TTGAAGTATA GAGGTTTGAA CATAAAGGAG GTGGAGAAGA TTGTCAGCAA GGCGAGGGGG AAGGACTTGT CAATCTCTTT CCGTGGCCAC ACAATTACAG TTACGCTTAC CTCGCCTGAT ATTTTTAAGC TAAGAAAAGT CAGAGATAAG ATACTGCAGA TAAAAGTAGC TGGGATAAAG GGAGTGAGGA AAGTGGTGCT CCAGTACGAC TCAAAAGCAG ATGAGTGGTT TATCGTAACT GAGGGGACAA ACTTGGAGGC GGTGCTACAA CTTGAGGAGG TAGACGCCAC GAGAACCTAC AGCAACGATC TCCACGAGGT GGAGGAGGTG TTAGGCATAG AGGCGGCCAG GGCTTTAGTG GCACAAGAAA TCAAGAGAGT TCTTGACGAG CAGGGCCTAG ATGTTGATAT TAGGCATATG TACATGGTGG CTGACACCAT GACTTGGTCA GGTAGGCTCA GGCCAATAGG ACGGCACGGA GTTGTTGGGA CTAAGGAATC CCCCTTGGCG CGCGCCGCCT TTGAGGTCAC CGTCAAGACG CTGATTGAGG CCTCTGTAAG AGGCGAAGAG GAGGCCTTTA AGGGAGTGGT CGAGAGCATA ATTGCGGGGA AGTATATACC CATCGGTACC GGCATCGTGC GCCTTTTGAT GCAGTTCTAA
|
Protein sequence | MISKQELLAK IQHVLPRPLY AEIENAVKDL DDERALRLIY RVLRLYLNSL IDPGEAIGIV TAQSIGEPGT QMILRSFHYA GLREFSMARG LPRLIEVVDA RRTPSTPLMY IYLKPPHNKS REAAEAVAKK IQQVTLEMLA KEVDVDYISG AVTIELDTEQ LKYRGLNIKE VEKIVSKARG KDLSISFRGH TITVTLTSPD IFKLRKVRDK ILQIKVAGIK GVRKVVLQYD SKADEWFIVT EGTNLEAVLQ LEEVDATRTY SNDLHEVEEV LGIEAARALV AQEIKRVLDE QGLDVDIRHM YMVADTMTWS GRLRPIGRHG VVGTKESPLA RAAFEVTVKT LIEASVRGEE EAFKGVVESI IAGKYIPIGT GIVRLLMQF
|
| |