Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1361 |
Symbol | |
ID | 5054181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1224088 |
End bp | 1225659 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640468907 |
Product | hypothetical protein |
Protein accession | YP_001153576 |
Protein GI | 145591574 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.26972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0191854 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTAAGA TCGGTGTAGT TGTCAAATCG CCGTCTCTGT ACTACTACGT CTTTAAGCCG TTTAGAGGCG TTGAACTTGA CGTAGGCTCT TTTGTGGCGA CGGAGATTGA TGGAGTTAGA GTCATTTCTA GGGTAGTAGC CATACGGCAT AGGAACGCCG TGGTGGACCC GCGCCTAATT GCTCACTTCG ACGAAGAGTC TACAGTAAAG GAGATTAAAG AAACTCTTGG AATAGAAGAG GCGCTGTACT ACACAGAGGC CAAAGCCGTG GTGCTCGGCG CCAGGAGCGG CCGGAAAATC CTCAAACCGC AGAAGCCCGT CAAGCCTCTG AGCTACGTGT ACAGCACCAC ACCCGAGGAA CTTGAACAAT TCTTCGCGCC TGGAGAAGAA GGAACATATA TACCAATAGG AAAGATAAAA GGCACTTCTA TTCCTGCATA CATAGACGCT GAGAGGCTAG TCACGCACCA CTGCGCAATT CTAGCCAGCA CCGGCGCCGG GAAGAGCTAC CTCGCCGGTG TGATAGTGGA GAGGCTCTCG GCGTTGGACA TCCCAATAGT AGTGATAGAC CCCCATGGAG AGTACTCCTC TATGGCGGTG CCGGCCACAG AAGAGGGCAA GCACGTATCG GAAAAGGTGA GGATTTTCGT CGTGGGCAAA ACAGACGTCA CTCACCTCGA CCAAGCCTTT AAGAAACGCT ACGGCATCCC CCGCACATAC ACGAGAATCG GCCTAAACCC ACGTAGCATT CCCCTACGCA CCCTAGAAAA GATCCTAGAC CTACTATACG GCCTTACGGA CGCACAACGA CGAATACTTG AAGAGGGGTG GCAAAGCGCC ACCAGCTACG GCGAGCGACA ACCTCTCACA TCCGTAGAAG AGCTAATAAA AGAAGTCCTA GAAGGCGGCA AACACGCAGC CCCGCCCGGC TTTGCGGGGG AGATGTCACT AAGAGGACTT GAAGGACGTC TAAGAGCCCT CTTCTACACA AGCCCCGTAT TCATTACGAG ATACGGCGAG ACGTATCAAG GAGAACCAAT CAAGCTAATA GACCCCGAGA TGTACCTCAC CACTCCGTCA ATACACATCT TCGACATTTC CGGCCTCGAC ATCCTCGACC AGCAACTCTT CCTCGCCGTA CTCCTAGACC AGCTCTACAG AGTATCCACA CTGAGGAAAA ACCTCACAAC ACTCCTCATA ATCGAAGAAG CCCACAACTA CGCCCCGGCG GCCGGCACTT CAGTAGCCAA AAGCTACATA GCAAAAATAG CAAGAGAAGG CAGGAAATTC GGCCTGGGGC TGTGCCTCAT CACCCAACGG CCTACAAAGT TGGACCCAGA CGTCGTATCC CAGGCAATGA CCCAGATATT CAAAAGAATG ATAAACCCCC ACGACTTGCG ATACGTAGCC ACAGTCGCGG AGCACCTAGA CGACCCTAGG CCGCTGAGAA CCCTAGACGA GGCAGAAGCA GTAGTAACAG GAATCTCAGT CCCAGTGCCG CTAATGATAG TAGTTGACCA AAGGTGGACG CAACACGGCG GAGTAACCCC AAGCATAAGA AGACAAGTAT AA
|
Protein sequence | MAKIGVVVKS PSLYYYVFKP FRGVELDVGS FVATEIDGVR VISRVVAIRH RNAVVDPRLI AHFDEESTVK EIKETLGIEE ALYYTEAKAV VLGARSGRKI LKPQKPVKPL SYVYSTTPEE LEQFFAPGEE GTYIPIGKIK GTSIPAYIDA ERLVTHHCAI LASTGAGKSY LAGVIVERLS ALDIPIVVID PHGEYSSMAV PATEEGKHVS EKVRIFVVGK TDVTHLDQAF KKRYGIPRTY TRIGLNPRSI PLRTLEKILD LLYGLTDAQR RILEEGWQSA TSYGERQPLT SVEELIKEVL EGGKHAAPPG FAGEMSLRGL EGRLRALFYT SPVFITRYGE TYQGEPIKLI DPEMYLTTPS IHIFDISGLD ILDQQLFLAV LLDQLYRVST LRKNLTTLLI IEEAHNYAPA AGTSVAKSYI AKIAREGRKF GLGLCLITQR PTKLDPDVVS QAMTQIFKRM INPHDLRYVA TVAEHLDDPR PLRTLDEAEA VVTGISVPVP LMIVVDQRWT QHGGVTPSIR RQV
|
| |