Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0414 |
Symbol | |
ID | 5054961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 362797 |
End bp | 364338 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640467979 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001152666 |
Protein GI | 145590664 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTGGG AGGTTGTACA CAGGGCTACC GAGTATATCG CCGAAGAGGC CGGCATCGCG TTGAGGAATT CAGCCTTCTC GCCCAATATA AGAGAGCGTA TGGACCACAG CGTCGCGGTT GTAGACGCCG AGGGCCGCAT TGTGGCTCAG GCAGAGCACA TCCCGGTCCA CCTAGGCTCC TTCCACGTGG GTGTGCAGAA CCTTCTAGAA TATTTGAGGA GGGAAGGGGT GGAGCTGGAG GATGGGGACG CTGTGCTCAC GAACGACCCC TACATATCGG GTACGCATCT AAACGACGTG ATGGTGCTCT ACCCCGTGTT CTGGCACGGG AGGCTCGTCG CTTATATAGC GTCTAAGGCC CACTACGTCG ACGTCGGGGG GCCGTTGCCG GCGTCGCTAA ACCCGAGGGC TAAGACTATA TACGAGGAGG GGGTCGTGGT GCCTCCTGTG AAGATATTGA GGCGCGGCGC AATGAACAAG GAGGCGTTGA GCTTTATCTT GGAGAACTTC AAGACGCCTG TAGTTGCAAG AGGCGATCTC GAGGCCCAGC TGGCAGCGTC GCGTGTAGGG GCGGCAAGGG TGAAGGACTT ATTCGAGAGG TTTGGCGACG TGTCGGAGCA GTGGAATGAG GCTATAGAAT ATGGCAGAAG GCTCGCGCTC GCGGAGATCG CCACGTGGCC TCCCGGCAGA TATGAGGCTG AGGACTACCT AGATTGGGGC GGCGAGCTCC TCCCCATAAG GCTAGCCCTG GAGATATCTG AGAAGGGCGT GAGGGCAGAC TTTGAGGGCA CCGCGAGGCA GGTCGACGCG CCGCTAAACG CTGTCATAGG CGTCACCTTC TCAGCCGTGT CATTTGCGGT GCGGTCAGCC ATAAGGGGCT ACATCCCCAC AAACTACGGC TTCTACAGCC TCATAAAGCT AGACGCACCT GCAGGTTCCA TAGTAAACCC CCTCAAGCCC GCTGCTGTGG GCGCCGGCAA CTTGGAGACG AGCCAGAGAG TCGCCGACGT CACGTTTCTG GCGCTGTCCA AGGCGCTACC TGGGAGGATA CCCGCGGCGG GTTCCGGCAC TATGATGAAC GTCATGATGG GCGGCTTCTG GCAAGGGCGG TACTGGTCAT ACTACGAGAC AATAGGCGGA GGGACTGGGG GCAGGCCCAA CGGCCCCGGC GTGTCGGGCG TGCACGTCAA CATGACAAAC ACGCTGAACA CGCCGATTGA AATTGCGGAG AGGGAGTACC CCATAAGATT CACTGCATAC CGAATAAGGG AGGGAAGCGG AGGACGCGGG AGGTATCCAG GCGGAGATGG TATAGTTAGG GCGTTCAAGG CACTGGCACC CACTACGCTG TCCATAATTG CAAGCAGGCT CGCAGTAGGG CCTTGGGGCC TAGAAGGCGG CGAGCCTGGC AAGCCAGGGA AAATAACTAT AAAGAGAAGC AGTGGCAGAG TCGAGTCCAT CGGTAGCGAG ACAGTGACGC TAGCAGAGGG AGACGAGGTG GTCATAGAGA CGCCGGGTGG GGGCGGGTAC GGCAAGCCGT AA
|
Protein sequence | MRWEVVHRAT EYIAEEAGIA LRNSAFSPNI RERMDHSVAV VDAEGRIVAQ AEHIPVHLGS FHVGVQNLLE YLRREGVELE DGDAVLTNDP YISGTHLNDV MVLYPVFWHG RLVAYIASKA HYVDVGGPLP ASLNPRAKTI YEEGVVVPPV KILRRGAMNK EALSFILENF KTPVVARGDL EAQLAASRVG AARVKDLFER FGDVSEQWNE AIEYGRRLAL AEIATWPPGR YEAEDYLDWG GELLPIRLAL EISEKGVRAD FEGTARQVDA PLNAVIGVTF SAVSFAVRSA IRGYIPTNYG FYSLIKLDAP AGSIVNPLKP AAVGAGNLET SQRVADVTFL ALSKALPGRI PAAGSGTMMN VMMGGFWQGR YWSYYETIGG GTGGRPNGPG VSGVHVNMTN TLNTPIEIAE REYPIRFTAY RIREGSGGRG RYPGGDGIVR AFKALAPTTL SIIASRLAVG PWGLEGGEPG KPGKITIKRS SGRVESIGSE TVTLAEGDEV VIETPGGGGY GKP
|
| |