Gene Pars_0382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0382 
Symbol 
ID5055475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp330915 
End bp332624 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content62% 
IMG OID640467949 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001152636 
Protein GI145590634 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0110228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTGT TGGTGAGGAG GGTGCTTTCG GTTAGGTCTG CCACTGCTCC TAGGCTTGGG 
GCTGGGGGGT TGGTGTTTTA TCTCAGCGAC GTGACGGGGG TTCAGCAGTT GTGGTTTTTC
GATGGGTCGC GGCACGACGT ATACGCGCCG GTTGAGGGCC GTGTTGGGGA CTACCGCGTC
TCGAAAGACG GCGTGGTGGC CGTTGCGGTT GACAGGGACG GGGACGAGAA GTGGAGGCTG
TACCTCCTGG GTGATGACCT CATGGAGGTC TCGGCTGAGG GCGTTAACAG CCTGGGGGCG
TGGTCCCCCG ACGGGTCGGC GCTGGCCTTT ACAAGCACTA AGGACAGCCC ATCGGATTTC
CACCTCTACG TCTACCGCCG CGGCGAGGGG GAAGTGGAGA GGCTGGCCGA GCTGGGGGGG
ATAAACGTGG TGGAGGAGTG GTCCGAGGCG GGGATCTTCG TGACGCACTA CGAAACAAAC
TTGGACAGTA CTATCTACCT ATTCCGAGAC GGCGAATTAA AGGAGCTTAC GAAACACAGC
GGCGAGGCGC TTAACTTCTC CCCTCGCTAC GTGGGGGGTG GGAAGGCCCT CTTCTTGACA
AATGCGGATT GGGAGTACGT GGGGGTTGCT CAGATGGACT TGGCGACCGG CTCCTGGAAG
TACCTTGTGC AACTTGACAG AGACGTGGAG AGGTTCGACG TGTGGGGGAA CTACCTCGTG
TTCTCGGTGA ATGAGGAGGG GCGCTCCGGC CTGTACCAGA TGCACATCCC ATCTGGCCTC
ACGTACAAGC TACCGGCGCC AGCTGGGGTG GCGACGCACC TCGAGTACAG AGACGGGGTG
GTGCTCTTCT CCCTGTCCGC CGTTAATAAG GGCCACGAGG TCTATGTATA CAAAGACGGG
GCGGTGAGGC AGCTGACCCG CTCGCCCCGC TTCGGGGCGC CGCTTGAGCA GATCCCGGAG
CCTCGCTCTG TGTGGTACCC CAGTTTCGAC GGGCGCAAGA TACAGGCCAA CATATACGCC
CCTCCTGGCG AGCCTAGGGG CGTGGTGGTG TACCTCCACG GAGGGCCCGA GAGCCAGGAC
CGGCCGGAGT TCAAGCCGCT AGTCGCCGCC ATGGTCTCCG CGGGTCTCCT CGTCGCGGCG
CCTAATTACC GTGGCAGCAC AGGCTTCGGC AAGTCCTTCG TCCACCTAGA CGACGTGGAG
AGGCGGTGGG ACGCCGTGAG GGACGTGGAG GTCTTCGCGA AGTGGTTGCA GGAGGAGGGA
ATTGCGAGGG GGAGGCCGTG CGTCGCCGGT GGCTCATACG GCGGCTACCT CACCCTCATG
GCCTTGGCCA CCGCGCCGGA TCTCTGGGCC TGCGGCGTGG AGATGGTGGG CATCTTCAAC
TTGGTGTCTT TCTTGGAGAG GACTGCGGCC TGGCGGAGGC GGTACAGGGA GGCGGAGTAC
GGCTCTCTCG ACAAGCAAAA AGACGTCCTC GTCCAGCTGA GCCCTGCCTC TCACGTGGAC
AAGATCAGGG CCCCCCTCAT GGTGGTCCAC GGCGCGAATG ACATCAGGGT GCCTGTTTAC
GAGGCTGAGC AACTGGTGCA GAGGCTGAGG GAGCTGGGGA GAGAGGCGAA GGCGCTTATC
CTGCCCGACG AGGGTCACGT AATTACAAAG GTGGAAAACC GGGTGAAGGT ATACACGGAG
GTGATTAAGT TTATTTTGCA ACATGTTTAA
 
Protein sequence
MELLVRRVLS VRSATAPRLG AGGLVFYLSD VTGVQQLWFF DGSRHDVYAP VEGRVGDYRV 
SKDGVVAVAV DRDGDEKWRL YLLGDDLMEV SAEGVNSLGA WSPDGSALAF TSTKDSPSDF
HLYVYRRGEG EVERLAELGG INVVEEWSEA GIFVTHYETN LDSTIYLFRD GELKELTKHS
GEALNFSPRY VGGGKALFLT NADWEYVGVA QMDLATGSWK YLVQLDRDVE RFDVWGNYLV
FSVNEEGRSG LYQMHIPSGL TYKLPAPAGV ATHLEYRDGV VLFSLSAVNK GHEVYVYKDG
AVRQLTRSPR FGAPLEQIPE PRSVWYPSFD GRKIQANIYA PPGEPRGVVV YLHGGPESQD
RPEFKPLVAA MVSAGLLVAA PNYRGSTGFG KSFVHLDDVE RRWDAVRDVE VFAKWLQEEG
IARGRPCVAG GSYGGYLTLM ALATAPDLWA CGVEMVGIFN LVSFLERTAA WRRRYREAEY
GSLDKQKDVL VQLSPASHVD KIRAPLMVVH GANDIRVPVY EAEQLVQRLR ELGREAKALI
LPDEGHVITK VENRVKVYTE VIKFILQHV