Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0197 |
Symbol | |
ID | 5055204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 175951 |
End bp | 177828 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640467776 |
Product | hypothetical protein |
Protein accession | YP_001152464 |
Protein GI | 145590462 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.317816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAC TCCTAATCAC GTTAGCACTG GCAACACTAG CTTTGGCGGC CACCACTGTG GTAATACCCC CAATGGCTAA GCTTGAGCAA ATTGCTTACA AAGTTGAGGA GACTACCTTG AAGATACAGG GAGCTGGCTT CGCCACGCTT GCCAAGCCGT ACGTCACTCC TGGTGAGGGC TACGTCTACG CCGGCATGAG GATTGAGTTC CTGGGCGCCT ACCCCTCTAT CCAGGTCGGG GCAGACGGCC AGCTCAGCAA GACCTTCGAT CAGAACGGCT TCGTGTCGAC CGTCTACGTC GGCCCCGACG CCTCTAAGGT GACGCTCGTC AACACGGCCA AGGAGCCCGT CGAGGTGAAG GTGAGGATCA CATACACCTA CGTCAAGGCC TCCTACATCT CGCTGAGCGG CGATGCTGTG GTGGAGGTAA ACGTGCCTGA CGGCAAGCTG GCCCAGGGCT TCAACGCAAT GGCGAGGCTC ACCATAGAGC CCTATGCCCC CTTCGTGGTG AAGGCGGTGG AGAGGCCAGA CGGCACCCCG GCCACAGTGT ACAGGGTGGA GCCCAAGGTT GTTGAGATAA ACACCCCGGG CAAGTACAAG ATAACGATCA CCCAGGGCGC CGCCCTTCCG GCGGCGATGC TGGTGAAGAG CCTCTCTAAG CAGACGGCAA CCGTCACAGC CGGCGGCGAG TTTGCAGTGA CCGGGGCAGA GGTGGGAGTC CCCCAGGGCT GGAAGTTGCT GGGCTATGCG GTGTTTGCCT ACACCGCTGA CGCCAACTTA ATAGGCAAGG AGGTCACTGG CGATATAAAG ATAGACGGCG GCTTGGTGGA CACTATCACT GACGTGAACC AAAACATCAT CGTGAGGAGT GTCAGCTATC TGGTGCCTCC TGTCTGGAAC TTCAATATTA GGTACAAGAT AGCGCTCGTA TACGGCGAGC AGTTCAAGGT CTCCACCACT CTGCCCAGCA CAGTTAATGT GATTTACATC CCGCTGGTGT ACAGAGAGGC ACAAGTCAAG TGGTTGCCCG ACCGCGCCCT GGTTAACGTC ACTGATGTTG ACGTGGCGGA CGGCCAGTGG ACAGCTGTGG TGTTGCAGCT ACCCGAGCTG GCCAAGATAG TGTCGATACG CACTCCGGGC AACGCGATGA TCTCCAACGC CACCGACGTC AGGCTGGTGT GGGGCGGCGG CCTTAGGGCA GTCTCAATCT CGCCAGACGG GAGGCAGGCA TACATAATCG CCCAACTCGG CGACACGAAG GAGACCGGCA TGTACACTTT CATGATAAAC TGGAAGCCCA TGCGGATCCC CGTCATTGAC ACTAAGGGCA GAGCCGTGGG CGACCTCTCA GCCTCTGCTG ACAAGTTTGA CGCCTCCGCC TCAGTTGGAT ACGTCGAGGT GAAGGTGTAC AAGCCCGAGC CCTTTGCCCT CGACATAAGC TACAAGGGCA TCCCGGCGGC CCACGTGGAG GTGAACTCCC TGGTGGAGAA GCCACAGGCC GTGACCCTCG GCATATACAC AGTCAAGGTT GTGGTGGTCG GGGCGTTGAA CCAGCCCATA GCCCAAGCCT CCGTGTCACT TGAGGGCTTC CCGGCCTCTG GAAAGACGGA CGGAGCTGGG TCGCTGGTGT TCCAAGATGT GTTGGAGGGA ACTTACAAGA TAAATGTTGA CATCGGTGGG AGGGTCAAGG TAAGCGAGGT CATAGAGGTG AGGGGCGACA CCGAGAAGAT AGTCAAGACG CCTGTGGTTG CGATAGTGGG CGGCGTGCCG ATAACCACTC TCGATGCCAT AGCCACGGCA GGTGGCTTGT CCGCGGCTGG GCTATACTTC GCGTTGACGA GAAGGAAGGA GTCAGTCGCC GAGGTAGAAC AGATATAA
|
Protein sequence | MNKLLITLAL ATLALAATTV VIPPMAKLEQ IAYKVEETTL KIQGAGFATL AKPYVTPGEG YVYAGMRIEF LGAYPSIQVG ADGQLSKTFD QNGFVSTVYV GPDASKVTLV NTAKEPVEVK VRITYTYVKA SYISLSGDAV VEVNVPDGKL AQGFNAMARL TIEPYAPFVV KAVERPDGTP ATVYRVEPKV VEINTPGKYK ITITQGAALP AAMLVKSLSK QTATVTAGGE FAVTGAEVGV PQGWKLLGYA VFAYTADANL IGKEVTGDIK IDGGLVDTIT DVNQNIIVRS VSYLVPPVWN FNIRYKIALV YGEQFKVSTT LPSTVNVIYI PLVYREAQVK WLPDRALVNV TDVDVADGQW TAVVLQLPEL AKIVSIRTPG NAMISNATDV RLVWGGGLRA VSISPDGRQA YIIAQLGDTK ETGMYTFMIN WKPMRIPVID TKGRAVGDLS ASADKFDASA SVGYVEVKVY KPEPFALDIS YKGIPAAHVE VNSLVEKPQA VTLGIYTVKV VVVGALNQPI AQASVSLEGF PASGKTDGAG SLVFQDVLEG TYKINVDIGG RVKVSEVIEV RGDTEKIVKT PVVAIVGGVP ITTLDAIATA GGLSAAGLYF ALTRRKESVA EVEQI
|
| |