Gene Pars_0732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0732 
Symbol 
ID5054678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp648284 
End bp650164 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content61% 
IMG OID640468289 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001152970 
Protein GI145590968 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.517111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATAG GGCCTGAGGA TTTGGGGAAG CTCGTCCTAG TCTCGGGGCC GGAGGCGGGC 
GGCGGGAGGA TCCTCTTCAC AGTCACGAGG ATCAGCTTGG AGAACGACCG CTATGAGTCC
AGCGTATGGG CGTGCGATGG GGGTTGCAGG GCGGTGCTGC CTGGCCCGTT CGACGTCTCG
GCTAAGCCGT CGCCAGACGG CTCCAGGATC GCCCTCCTTT CCAGGAGGGG CTTCGGGGAG
AAGGAGAAGG GAGTTGGCCT ATGGGTCGCG GAGTGGGGCG GGGAGCCGCG GTTGCTTGCT
AAGTTCCTCG GCGTGTTGGA CTACGCGTGG GCCCCATCGG GAGAGGCCCT CGCAGTGGTG
GCCTACGAGG GGTCTCCAGA GGCTGACGTG AAGCACGCGG AGAGGCTCCC CCTGTGGATA
AACGACTTCG GCTTTGCCTA CAACGTGTCT AGCCATCTCT ACCTAGTCGA CGCCTACAGC
GGCGTCGTTG AGAAGCTGAC GGAAGGCGAT GTTCGGGTGC TAAAAGCGGC TTTCAGCCCA
GACGGGAGGA GGGTGGCGTA CGCAGTGGCC AGAGACTGGC TTAGGCCGTA TCTCCAAGAC
GTCGTTGTTC TGGATTTAAA AAGTGGGGAG CGGGCTACCT GGGCGTCTGG CTACACGTCT
GTTACTGAGA TCGCTTGGCA CCCCAACGGC CGGGGCTTGG CCTTCACGGG CCACCTAAGG
CCGAGGGGCT TTTCCTCCCA CTCCAGGGTG TGGGTAGTGG AGGAGGGGGG AGAGCCCCGT
TGCCTAACCT GTAGCTTTAA ATACGGCGTG GGCAACGGCG TCAACAGCGA CGCGAGGGGG
CCTAGCTATA CAAGATCCCT GTACTGGGAC GGCGGCTCAG TCCTCTTCCA AGCCACCGTA
GGGGGCTCCG TCGGCGTGTA CCGCGCCTCT CTGAGCGGCG AGGTGGAGGC CGTGTTGGCC
CCAAGAGGCG TCGTTGATGA GTTCGTCCCA GTAGGTAGGG ACATATTGTA CACATACATG
GAGGCTGACA AGCCAAAGGA GCTTTACCTG TGGGACGGGT CGGAGGCGAA GAAGCTTACG
CGTTTCAACG ACTTCGTGCT GGAGAGGTGG CGCTTGAAGA GGCCCCAGAG GTTTGTAATG
AAGGCAAGCG ACGGGGCGGA GGTGGAGGGC TGGGTCCTCC TGCCGGAGGG CGCTGGACCG
CATAAGTGGG TTCTCTACAT ACACGGCGGG CCGAAGACGG CGTACGGCGA GGGGTTCATG
TTTGAGTTCC ACCTCCTAGC CTCGCGGGGC TACGCCGTGG TGTTCTCGAA CCCCAGGGGG
AGCGACGGCT ACGACGAGGA GTTCGCCGAC ATTAGGTGCA GATACGGCGA GAGGGATTTC
CAAGACTTAA TGGAGGTGGC GGACTACGCG GTGAGGAACT TCCCGCTTGA CCCCCAGAAG
GCGGCGGTGG CGGGGGGCTC ATACGGCGGG TTTATGACAA ACTGGATAAT TACGCGGGTG
GACAAGTTCA AGGCGGCTGT GACCCAGCGC TCTATCTGCG ACTGGGTCTC CATGTACGGC
ACGACTGACA TCGGGTGGTA CTTCGTGGAG GACCAGCTGT GTTGCACGCC GTGGAGGGAC
AGAGAGCGTT GCATCGAGAA GAGCCCGCTG TACTACGCTG ACAGGGTGAA GACTCCCACG
CTCATCATAC ACTCCATGGA GGACTACCGG ACGTGGCTCG ACCAAGGCGT GCTCTTCTTC
ACCGCGCTGA GGTTACACGG CGTGGAGGCG AGGCTGGCCA TTTTCCCCGA GGAGAGCCAT
GAGCTCACCA GAAAGGGGAA GCCCAGGCAC CGGGTGGAAA ACTTTAAGGA GATCTTGAGC
TGGCTTGATA AACATGTATA A
 
Protein sequence
MAIGPEDLGK LVLVSGPEAG GGRILFTVTR ISLENDRYES SVWACDGGCR AVLPGPFDVS 
AKPSPDGSRI ALLSRRGFGE KEKGVGLWVA EWGGEPRLLA KFLGVLDYAW APSGEALAVV
AYEGSPEADV KHAERLPLWI NDFGFAYNVS SHLYLVDAYS GVVEKLTEGD VRVLKAAFSP
DGRRVAYAVA RDWLRPYLQD VVVLDLKSGE RATWASGYTS VTEIAWHPNG RGLAFTGHLR
PRGFSSHSRV WVVEEGGEPR CLTCSFKYGV GNGVNSDARG PSYTRSLYWD GGSVLFQATV
GGSVGVYRAS LSGEVEAVLA PRGVVDEFVP VGRDILYTYM EADKPKELYL WDGSEAKKLT
RFNDFVLERW RLKRPQRFVM KASDGAEVEG WVLLPEGAGP HKWVLYIHGG PKTAYGEGFM
FEFHLLASRG YAVVFSNPRG SDGYDEEFAD IRCRYGERDF QDLMEVADYA VRNFPLDPQK
AAVAGGSYGG FMTNWIITRV DKFKAAVTQR SICDWVSMYG TTDIGWYFVE DQLCCTPWRD
RERCIEKSPL YYADRVKTPT LIIHSMEDYR TWLDQGVLFF TALRLHGVEA RLAIFPEESH
ELTRKGKPRH RVENFKEILS WLDKHV