Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2322 |
Symbol | |
ID | 5054559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2078001 |
End bp | 2080655 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469874 |
Product | DNA-directed RNA polymerase subunit A' |
Protein accession | YP_001154518 |
Protein GI | 145592516 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02390] DNA-directed RNA polymerase subunit A' |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0946262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0000535013 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCGTTAA AGGAGGAACT TGACACTATT CCTCGTAAGG TGATTAAATC TATAAAATTC GGAGTCCTTA GCCCAGAGGT TATTCGAAAG TACTCGGTGA TGGAGGTGAC AACATCGGAA GTGTACGATG AAGGCGGCTT ACCGGTGAGG GGTGGTATTT CTGACCGGAG GCTGGGGGTG GCCGAGCCCG GGGCACGTTG CGAGACGTGT GGTCAAACTC ACGATGTTTG TCCGGGCCAT TTTGGACACA TAGAGCTGGT TAAGCCTGTG GTTCACGTTG GTTTTGCCCG TGTGATATAT GACATTTTGA GAACTACTTG TCCCAACTGC GGGCGCATAA TGCTCCGCGA CGAGGAGATT GCGCGGTATA GGGAGAGGCT GACTAGGCTT AGTAAGAGGT GGCGTCTGCT TGCGCAGAAC CTGCACGAGA GAATTAGGAG GAAGGCGGCG GAGCGTATGA CTTGTCCCCA CTGTGGGTAT AAGCGGAATA AGGTAAGGTT TGAGCGGCCG TACTACTTCT ACGAGGAGAC TGAGAATGGC GCCTTGGTTA AGCTTGATCC CGAGATGCTG AGAGATAGGC TTAGTAAGAT ACCGAGTGAA GACTTGGAGC TTCTTGGGAT CAACCCTTCT GTTTTCCGGC CCGAGTGGGC AATTCTTAAG GTGTTGCCCG TTCCGCCTCC GCATGTGAGA CCTTCTATAC AGCTGGAGAC CGGTATTAGG TCGGAGGACG ACTTAACTCA CAAGCTTGTG GATATTATTA GGATGAACGA GAAGCTTAAA ATCGCCATTG AGACCGGCGC TCCTACTAAC GTGGTGGACA ACCTCTGGGA CCTCCTCCAG TACCACGTCG CTACTTACTT TGACAACGAG CTCCCGGGGA TCCCGGTGGC TAAGCACAGG GGCGGGAGGC CGCTTAAGGG GATAGCCCAG CGCCTAAAGG GCAAGGAGGG GCGCTTCAGG GGATCTCTCA GCGGGAAGCG CGTGAACTTC TCGGCGCGTA CGGTCATCAG CCCCGACCCC CACATCAGCA TAAACGAGGT TGGAGTGCCT ACTGATATCG CCAAGATTTT GACTGTGCCG GAGAAGGTGA CTGCTTGGAA TATAGACGTG CTACGGGAGT ACGTGATCCG CGGCCCCGAG ACTTGGCCAG GGGCGAACTA CGTCGTGACG CCCGAGGGGA GGAGGATAGA TCTACGCTAT GTGAAGGACA GGAAGGCCCT CGCGGAGAGG CTGGCCCCGG GTTGGGTGGT AGAGAGGCAC TTAAGAGACG GCGACATTGT TCTTTTCAAC AGACAGCCTT CTCTTCACAG GGTGTCTATG ATGGGCCACT TGGTCAAGGT GTTGCCGGGG AGGACGTTTA GATTGCACCT GGCCGTGTGT CCTCCGTATA ACGCTGATTT TGACGGGGAC GAGATGAACC TCCATGTGCC GCAGACGGAG GAGGCTAGGG CGGAGGCGAG GTTGTTAATG CTGGTGGAGA ATCACATAAT CACTCCGCGC TACGGCGGTG CCATTATCGG AGCGAGACAA GACTACATAA TCGGGGCCTA TCTGCTTTCT CACAAAACTA CTTTCTTAAC TAAGAAGGAG GTGGCCTTCC TCCTGGGCGC GGGTAAGTCG GAGGAGGATC CACCGGAGCC GGCGATACTC TACCCCGTAG AGCTGTGGAC GGGTAAGCAG ATTATCTCCC ACTTCCTACC GAAGGATTTC AACTGGGTGC AACCCACCGC TTTTAAGTCG AAGTGCCAAG ACGCCTATAC TTGCTATGGC GATGAGTGGA TTATCGTGCT GAACGGCTAC TTAGCAAAGG GGGTGTTGGA CAAAAAATCG ATAGGCGCCG AGCAGGTGGA CTCTCTCTGG CACCGCATTG CCCGAGACTA CCCGCCGGAT GTGGCGAGGA GGTGGCTTGA CTCGTCTCTC CGGTTGTTCC TAAGGTATCT TGATCTACGC GGCTTCACCT TTGCCATGGA CTCAGTCTAT ATACCTACGG AGGCGTATAG GGAGGTGGAA GAAGTCATAG AGCAGGCCTT GAAGAAGGTT GAAGGACTTA TCGAGGATTT CACAAGCGGG CGTCTTGAGG CCATGCCCGG CTTTACGGTG GAGGAGACGT TTGAAAACAA GGTTACAGAT ATCTTGTCAA GAGTTCGCGA AGACGCGGCG CAGGTTGTGG AGAAGTATAT TGATAAGAAC TCTGAGGGCT ACCTAATGGC TAAGACCGGC GCTAGGGGTA GTCTCGTTAA CATAGTGCAG ATGGTCGCCA CGCTGGGCCA GCAGACTATC CGGGGCGAGC GCATTAGGCG GGGCTTTAGA AGTAGGACGC TTCCTCACTT CCCTGTGGGG GATATAGGCG CCTTTTCTGG GGGCTTTGTT AAACATTGCT TTAGGTGCGG CCTCACGCCT GTTGAGTATT TCTTCCACGC GGCGGCGGGT AGAGATGGGT TGATAGACAC GGCGGTGCGC ACGGCGCAGT CAGGCTACAT GCAGAGGCGT CTTATCAACG CGTTGCAAGA CGTCTACGTG GCCTACGATG GGACCGTGAG GTTTGGCGGC TCTATGCTAC TCCAGCCCCT ATACGGCGAA GACGGCGTTG ATGTGAGCCG TTCAGACCAC GGCAAGGTCA CAGACATAAA GCTACTCAAG ATGTGGATAA GATGA
|
Protein sequence | MSLKEELDTI PRKVIKSIKF GVLSPEVIRK YSVMEVTTSE VYDEGGLPVR GGISDRRLGV AEPGARCETC GQTHDVCPGH FGHIELVKPV VHVGFARVIY DILRTTCPNC GRIMLRDEEI ARYRERLTRL SKRWRLLAQN LHERIRRKAA ERMTCPHCGY KRNKVRFERP YYFYEETENG ALVKLDPEML RDRLSKIPSE DLELLGINPS VFRPEWAILK VLPVPPPHVR PSIQLETGIR SEDDLTHKLV DIIRMNEKLK IAIETGAPTN VVDNLWDLLQ YHVATYFDNE LPGIPVAKHR GGRPLKGIAQ RLKGKEGRFR GSLSGKRVNF SARTVISPDP HISINEVGVP TDIAKILTVP EKVTAWNIDV LREYVIRGPE TWPGANYVVT PEGRRIDLRY VKDRKALAER LAPGWVVERH LRDGDIVLFN RQPSLHRVSM MGHLVKVLPG RTFRLHLAVC PPYNADFDGD EMNLHVPQTE EARAEARLLM LVENHIITPR YGGAIIGARQ DYIIGAYLLS HKTTFLTKKE VAFLLGAGKS EEDPPEPAIL YPVELWTGKQ IISHFLPKDF NWVQPTAFKS KCQDAYTCYG DEWIIVLNGY LAKGVLDKKS IGAEQVDSLW HRIARDYPPD VARRWLDSSL RLFLRYLDLR GFTFAMDSVY IPTEAYREVE EVIEQALKKV EGLIEDFTSG RLEAMPGFTV EETFENKVTD ILSRVREDAA QVVEKYIDKN SEGYLMAKTG ARGSLVNIVQ MVATLGQQTI RGERIRRGFR SRTLPHFPVG DIGAFSGGFV KHCFRCGLTP VEYFFHAAAG RDGLIDTAVR TAQSGYMQRR LINALQDVYV AYDGTVRFGG SMLLQPLYGE DGVDVSRSDH GKVTDIKLLK MWIR
|
| |