Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1108 |
Symbol | |
ID | 5055970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 994887 |
End bp | 996491 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640468664 |
Product | AAA ATPase |
Protein accession | YP_001153338 |
Protein GI | 145591336 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00107728 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATGTG GGTTCGTATC GCGGCAGTTC CCGGCTGAGG TTTCTTTCGA CGAGGCTAAG ACCTACGCCG TCTTGACTGG TTGCGGCTTG GAGGTGGGGT CCTACGTGGT GGTTGAGGGG GGTGGGAGGA GGTACCTGGC CTCGGTGTCT AAGCTCCAGG TGGCGGACAT ATACGCCGTG GCGAAGACGC CGGTGCTGAC GCCGGAGCAG GAGAAGGCGG TTTCCCTGAG GCTGGGCCCC TTGATTGCCG AGCTGGAGAT CCTCTCGGAG TGTTTCGGCA ACTCGTGCGG CCCGCCCGCG ACGCCGGTGC CCATCCACTC CTCTGTGAGG CCTCCTGCCC CTGGGGAGGT GGGGGAGATG CTGGGCCTGC CGCGGGAGGG GGTTGTCCTC GGCGCGTTGG CCCTGCCCTC GGGTCGGGCG GTGGAGGGGG AGGTGGTTAG GCTCCCGCTG GAGGCGCTTA GGCACCACGT CTTGGTGGTG GGGACTACTG GGAGTGGGAA GACGGTGTTG GTGAAGGAGA TTGCAAAGCA GCTGGCGGGT CAGCAGGTGG TTGCCCTAGA CGCCGTGGGC CACTTCTACC ACTTGGCGTA CAGCGGGGTG AGGGTGAGGG TGGTGCTCCC CGTCACTAGG CAGATGGCGC GTCGCGGGGC GAGGGCGGTG GTGAGGAGGG TGGTGAAGAC GGCGGCGTGG GGGAGCAGGG CCCGCTTCAA GGCGAGGGTT AGGTTCAAGA AGCAACGGGC TACTGGGGAG GTGTACCTGG CGGGGGCGGT GGTGGAGGTT GAGTCGCCCC GCGGCCGGGG GGTTTTCGAG GTGGTTCCCT GGGCGCTGGA GAGCAGGCGT ATTTTGAGGG ACCTCCCCCG GGCTATCCCC ATCCTCTCGC AACAGGCTAG GATTTTCTAC CAGAGGGTTT TGAAGAGGGC GGTGGAGCTC AGTGGGCAGA AGACGGCGGA GGGGCTTTAC GAGTTTTTGA CCTCGCCGGC TGAGGGGGAG AGGAGGCCGG TGGTGATGTA CGAGAAGCTG GGCATGGAGC TGGGGCTTCA CTCCAGCACT ATGGAGAACA TCGTGAGGGC TCTCCTGGCG CTGGTGGAGA CGGGGCTTGT GGACGTAGTG GGGGGCGGGT TCAGGGTGGT GGAGCCGCGG TACGACTTCT CGGGGTACAC GGTGGTGGAC ATATCTCGCC TCAACGTGCA CCAGCAGAGG CTTGTGGTGT ACCGCATCTT AGACGCGGTG TATAGGAGGG CGCGGCCGAC CACCGCCGTC CTCATCGACG AGGCCCACCT CTTCTTCCCC CAGACCCGGA GCGAGGACGA GCAGGCCTTT ATAGAGGGGC ACTTGACGCG GCTTACGAGG CTTGGGAGGG CGAGGGGCAT CGCAGTGGTC TTCGCCACCC ACATGCCCAC CGACCTAAAC GACGTGGTGG TGCAGCTGGC TAACACCAAG GTTATCCTCC GTAGCGACCT CAAGATTTTG GACAGGCTGG ACGTCCCCGC GAAGGACAGG CGGTTCCTCG CCGTGGCGGA CAAGGGCATC GCCTACGTCA GGAGCTACGC CTATAGGCAC CCTATATACG TCAAGGTGCA GAAGACCGTT GCCCACTTCG GCTGA
|
Protein sequence | MTCGFVSRQF PAEVSFDEAK TYAVLTGCGL EVGSYVVVEG GGRRYLASVS KLQVADIYAV AKTPVLTPEQ EKAVSLRLGP LIAELEILSE CFGNSCGPPA TPVPIHSSVR PPAPGEVGEM LGLPREGVVL GALALPSGRA VEGEVVRLPL EALRHHVLVV GTTGSGKTVL VKEIAKQLAG QQVVALDAVG HFYHLAYSGV RVRVVLPVTR QMARRGARAV VRRVVKTAAW GSRARFKARV RFKKQRATGE VYLAGAVVEV ESPRGRGVFE VVPWALESRR ILRDLPRAIP ILSQQARIFY QRVLKRAVEL SGQKTAEGLY EFLTSPAEGE RRPVVMYEKL GMELGLHSST MENIVRALLA LVETGLVDVV GGGFRVVEPR YDFSGYTVVD ISRLNVHQQR LVVYRILDAV YRRARPTTAV LIDEAHLFFP QTRSEDEQAF IEGHLTRLTR LGRARGIAVV FATHMPTDLN DVVVQLANTK VILRSDLKIL DRLDVPAKDR RFLAVADKGI AYVRSYAYRH PIYVKVQKTV AHFG
|
| |