Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2086 |
Symbol | |
ID | 5054835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1862850 |
End bp | 1863821 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640469635 |
Product | peptidase M24 |
Protein accession | YP_001154284 |
Protein GI | 145592282 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.165828 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATAG CCACCACGAC GGTGAACTTC GCCTACTTCA CCGGGGCCTG GGTGGAGACC TTCGAGCGCT TCAAGGCTGC TGTCAAGTGC GGGGACTACT CGGCGGCTGT TGTGCCCCAG CTTGACGCAG AGCGCGTAGG GGGCAACGTA TTTGCCTACA AAGACGGCGA GGACCCTGCA GAGGCTCTCC GAAAAGCAGC CTCTAGGTGC GACGCGTCGG TGGTATACGT CGATGGGGGC ACCACACTAC GCCACTTCGA AATTATAAAA AGGGCGCTTC CCAACGCCGA GTTTCGCCTA GCGGACGAAA TATTAAGGGA GTACAGGTCG GTGAAGAGGG GCGACGAAAT CGAGAAGATA AAGACGGCGG CAACCAAGAT CAGGAAGGTC CTAGAAGCTG TTGAGCTGAC ACCTGGGATA ACAGAGCGCG AGGTGGCTTT CAAGATCTAC TCCATGCTTT ACGAAGAGGG GCTCCACCCA GGGCCTATAC TTGTCCAATT CGGCCCTAAC ACTGCACTGC CACATCTAGA GCCCACGGAG AAGAAGCTTC ACCGAGACGA GGCTGTGGTG CTAGACATCT CTGCCTCCTA CCGTGGCTAC TACGGCGACT TGACAACGTC GTTCTTCTTC GGCGAGGCCC CGCCCCAGTA CGCGGAAATA TACAACACCG TAAAGGAGGC ACAAGCTACG GCGCTGGCCT CGGCAAAGCC CGGCGTCGGA GCCGCAGAGG TGGACAAAGC CGCCCGCGCA GTTATTGAGG CGAGAGGCTA CGGGCGCTAC TTCATACACC GCACGGGTCA CGGCCTCGGC CTTGAAATAC ACGAAGCTCC CGACATCTCT CCCAACTCGC CCGACGTGCT GAAGCCTGGG ATGGTCTTCA CAATAGAGCC CGGGATATAC CTGCCAGGGA AGTTCGGAGT AAGGCTGGAG ATAGACGTGG TGGTGGAAAA AGACGGCGCC CACCCCCTTT AG
|
Protein sequence | MIIATTTVNF AYFTGAWVET FERFKAAVKC GDYSAAVVPQ LDAERVGGNV FAYKDGEDPA EALRKAASRC DASVVYVDGG TTLRHFEIIK RALPNAEFRL ADEILREYRS VKRGDEIEKI KTAATKIRKV LEAVELTPGI TEREVAFKIY SMLYEEGLHP GPILVQFGPN TALPHLEPTE KKLHRDEAVV LDISASYRGY YGDLTTSFFF GEAPPQYAEI YNTVKEAQAT ALASAKPGVG AAEVDKAARA VIEARGYGRY FIHRTGHGLG LEIHEAPDIS PNSPDVLKPG MVFTIEPGIY LPGKFGVRLE IDVVVEKDGA HPL
|
| |