Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0382 |
Symbol | |
ID | 5055475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 330915 |
End bp | 332624 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640467949 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001152636 |
Protein GI | 145590634 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0110228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTGT TGGTGAGGAG GGTGCTTTCG GTTAGGTCTG CCACTGCTCC TAGGCTTGGG GCTGGGGGGT TGGTGTTTTA TCTCAGCGAC GTGACGGGGG TTCAGCAGTT GTGGTTTTTC GATGGGTCGC GGCACGACGT ATACGCGCCG GTTGAGGGCC GTGTTGGGGA CTACCGCGTC TCGAAAGACG GCGTGGTGGC CGTTGCGGTT GACAGGGACG GGGACGAGAA GTGGAGGCTG TACCTCCTGG GTGATGACCT CATGGAGGTC TCGGCTGAGG GCGTTAACAG CCTGGGGGCG TGGTCCCCCG ACGGGTCGGC GCTGGCCTTT ACAAGCACTA AGGACAGCCC ATCGGATTTC CACCTCTACG TCTACCGCCG CGGCGAGGGG GAAGTGGAGA GGCTGGCCGA GCTGGGGGGG ATAAACGTGG TGGAGGAGTG GTCCGAGGCG GGGATCTTCG TGACGCACTA CGAAACAAAC TTGGACAGTA CTATCTACCT ATTCCGAGAC GGCGAATTAA AGGAGCTTAC GAAACACAGC GGCGAGGCGC TTAACTTCTC CCCTCGCTAC GTGGGGGGTG GGAAGGCCCT CTTCTTGACA AATGCGGATT GGGAGTACGT GGGGGTTGCT CAGATGGACT TGGCGACCGG CTCCTGGAAG TACCTTGTGC AACTTGACAG AGACGTGGAG AGGTTCGACG TGTGGGGGAA CTACCTCGTG TTCTCGGTGA ATGAGGAGGG GCGCTCCGGC CTGTACCAGA TGCACATCCC ATCTGGCCTC ACGTACAAGC TACCGGCGCC AGCTGGGGTG GCGACGCACC TCGAGTACAG AGACGGGGTG GTGCTCTTCT CCCTGTCCGC CGTTAATAAG GGCCACGAGG TCTATGTATA CAAAGACGGG GCGGTGAGGC AGCTGACCCG CTCGCCCCGC TTCGGGGCGC CGCTTGAGCA GATCCCGGAG CCTCGCTCTG TGTGGTACCC CAGTTTCGAC GGGCGCAAGA TACAGGCCAA CATATACGCC CCTCCTGGCG AGCCTAGGGG CGTGGTGGTG TACCTCCACG GAGGGCCCGA GAGCCAGGAC CGGCCGGAGT TCAAGCCGCT AGTCGCCGCC ATGGTCTCCG CGGGTCTCCT CGTCGCGGCG CCTAATTACC GTGGCAGCAC AGGCTTCGGC AAGTCCTTCG TCCACCTAGA CGACGTGGAG AGGCGGTGGG ACGCCGTGAG GGACGTGGAG GTCTTCGCGA AGTGGTTGCA GGAGGAGGGA ATTGCGAGGG GGAGGCCGTG CGTCGCCGGT GGCTCATACG GCGGCTACCT CACCCTCATG GCCTTGGCCA CCGCGCCGGA TCTCTGGGCC TGCGGCGTGG AGATGGTGGG CATCTTCAAC TTGGTGTCTT TCTTGGAGAG GACTGCGGCC TGGCGGAGGC GGTACAGGGA GGCGGAGTAC GGCTCTCTCG ACAAGCAAAA AGACGTCCTC GTCCAGCTGA GCCCTGCCTC TCACGTGGAC AAGATCAGGG CCCCCCTCAT GGTGGTCCAC GGCGCGAATG ACATCAGGGT GCCTGTTTAC GAGGCTGAGC AACTGGTGCA GAGGCTGAGG GAGCTGGGGA GAGAGGCGAA GGCGCTTATC CTGCCCGACG AGGGTCACGT AATTACAAAG GTGGAAAACC GGGTGAAGGT ATACACGGAG GTGATTAAGT TTATTTTGCA ACATGTTTAA
|
Protein sequence | MELLVRRVLS VRSATAPRLG AGGLVFYLSD VTGVQQLWFF DGSRHDVYAP VEGRVGDYRV SKDGVVAVAV DRDGDEKWRL YLLGDDLMEV SAEGVNSLGA WSPDGSALAF TSTKDSPSDF HLYVYRRGEG EVERLAELGG INVVEEWSEA GIFVTHYETN LDSTIYLFRD GELKELTKHS GEALNFSPRY VGGGKALFLT NADWEYVGVA QMDLATGSWK YLVQLDRDVE RFDVWGNYLV FSVNEEGRSG LYQMHIPSGL TYKLPAPAGV ATHLEYRDGV VLFSLSAVNK GHEVYVYKDG AVRQLTRSPR FGAPLEQIPE PRSVWYPSFD GRKIQANIYA PPGEPRGVVV YLHGGPESQD RPEFKPLVAA MVSAGLLVAA PNYRGSTGFG KSFVHLDDVE RRWDAVRDVE VFAKWLQEEG IARGRPCVAG GSYGGYLTLM ALATAPDLWA CGVEMVGIFN LVSFLERTAA WRRRYREAEY GSLDKQKDVL VQLSPASHVD KIRAPLMVVH GANDIRVPVY EAEQLVQRLR ELGREAKALI LPDEGHVITK VENRVKVYTE VIKFILQHV
|
| |