Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0732 |
Symbol | |
ID | 5054678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 648284 |
End bp | 650164 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640468289 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001152970 |
Protein GI | 145590968 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.517111 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATAG GGCCTGAGGA TTTGGGGAAG CTCGTCCTAG TCTCGGGGCC GGAGGCGGGC GGCGGGAGGA TCCTCTTCAC AGTCACGAGG ATCAGCTTGG AGAACGACCG CTATGAGTCC AGCGTATGGG CGTGCGATGG GGGTTGCAGG GCGGTGCTGC CTGGCCCGTT CGACGTCTCG GCTAAGCCGT CGCCAGACGG CTCCAGGATC GCCCTCCTTT CCAGGAGGGG CTTCGGGGAG AAGGAGAAGG GAGTTGGCCT ATGGGTCGCG GAGTGGGGCG GGGAGCCGCG GTTGCTTGCT AAGTTCCTCG GCGTGTTGGA CTACGCGTGG GCCCCATCGG GAGAGGCCCT CGCAGTGGTG GCCTACGAGG GGTCTCCAGA GGCTGACGTG AAGCACGCGG AGAGGCTCCC CCTGTGGATA AACGACTTCG GCTTTGCCTA CAACGTGTCT AGCCATCTCT ACCTAGTCGA CGCCTACAGC GGCGTCGTTG AGAAGCTGAC GGAAGGCGAT GTTCGGGTGC TAAAAGCGGC TTTCAGCCCA GACGGGAGGA GGGTGGCGTA CGCAGTGGCC AGAGACTGGC TTAGGCCGTA TCTCCAAGAC GTCGTTGTTC TGGATTTAAA AAGTGGGGAG CGGGCTACCT GGGCGTCTGG CTACACGTCT GTTACTGAGA TCGCTTGGCA CCCCAACGGC CGGGGCTTGG CCTTCACGGG CCACCTAAGG CCGAGGGGCT TTTCCTCCCA CTCCAGGGTG TGGGTAGTGG AGGAGGGGGG AGAGCCCCGT TGCCTAACCT GTAGCTTTAA ATACGGCGTG GGCAACGGCG TCAACAGCGA CGCGAGGGGG CCTAGCTATA CAAGATCCCT GTACTGGGAC GGCGGCTCAG TCCTCTTCCA AGCCACCGTA GGGGGCTCCG TCGGCGTGTA CCGCGCCTCT CTGAGCGGCG AGGTGGAGGC CGTGTTGGCC CCAAGAGGCG TCGTTGATGA GTTCGTCCCA GTAGGTAGGG ACATATTGTA CACATACATG GAGGCTGACA AGCCAAAGGA GCTTTACCTG TGGGACGGGT CGGAGGCGAA GAAGCTTACG CGTTTCAACG ACTTCGTGCT GGAGAGGTGG CGCTTGAAGA GGCCCCAGAG GTTTGTAATG AAGGCAAGCG ACGGGGCGGA GGTGGAGGGC TGGGTCCTCC TGCCGGAGGG CGCTGGACCG CATAAGTGGG TTCTCTACAT ACACGGCGGG CCGAAGACGG CGTACGGCGA GGGGTTCATG TTTGAGTTCC ACCTCCTAGC CTCGCGGGGC TACGCCGTGG TGTTCTCGAA CCCCAGGGGG AGCGACGGCT ACGACGAGGA GTTCGCCGAC ATTAGGTGCA GATACGGCGA GAGGGATTTC CAAGACTTAA TGGAGGTGGC GGACTACGCG GTGAGGAACT TCCCGCTTGA CCCCCAGAAG GCGGCGGTGG CGGGGGGCTC ATACGGCGGG TTTATGACAA ACTGGATAAT TACGCGGGTG GACAAGTTCA AGGCGGCTGT GACCCAGCGC TCTATCTGCG ACTGGGTCTC CATGTACGGC ACGACTGACA TCGGGTGGTA CTTCGTGGAG GACCAGCTGT GTTGCACGCC GTGGAGGGAC AGAGAGCGTT GCATCGAGAA GAGCCCGCTG TACTACGCTG ACAGGGTGAA GACTCCCACG CTCATCATAC ACTCCATGGA GGACTACCGG ACGTGGCTCG ACCAAGGCGT GCTCTTCTTC ACCGCGCTGA GGTTACACGG CGTGGAGGCG AGGCTGGCCA TTTTCCCCGA GGAGAGCCAT GAGCTCACCA GAAAGGGGAA GCCCAGGCAC CGGGTGGAAA ACTTTAAGGA GATCTTGAGC TGGCTTGATA AACATGTATA A
|
Protein sequence | MAIGPEDLGK LVLVSGPEAG GGRILFTVTR ISLENDRYES SVWACDGGCR AVLPGPFDVS AKPSPDGSRI ALLSRRGFGE KEKGVGLWVA EWGGEPRLLA KFLGVLDYAW APSGEALAVV AYEGSPEADV KHAERLPLWI NDFGFAYNVS SHLYLVDAYS GVVEKLTEGD VRVLKAAFSP DGRRVAYAVA RDWLRPYLQD VVVLDLKSGE RATWASGYTS VTEIAWHPNG RGLAFTGHLR PRGFSSHSRV WVVEEGGEPR CLTCSFKYGV GNGVNSDARG PSYTRSLYWD GGSVLFQATV GGSVGVYRAS LSGEVEAVLA PRGVVDEFVP VGRDILYTYM EADKPKELYL WDGSEAKKLT RFNDFVLERW RLKRPQRFVM KASDGAEVEG WVLLPEGAGP HKWVLYIHGG PKTAYGEGFM FEFHLLASRG YAVVFSNPRG SDGYDEEFAD IRCRYGERDF QDLMEVADYA VRNFPLDPQK AAVAGGSYGG FMTNWIITRV DKFKAAVTQR SICDWVSMYG TTDIGWYFVE DQLCCTPWRD RERCIEKSPL YYADRVKTPT LIIHSMEDYR TWLDQGVLFF TALRLHGVEA RLAIFPEESH ELTRKGKPRH RVENFKEILS WLDKHV
|
| |