Gene Pars_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0414 
Symbol 
ID5054961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp362797 
End bp364338 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content59% 
IMG OID640467979 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001152666 
Protein GI145590664 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTGGG AGGTTGTACA CAGGGCTACC GAGTATATCG CCGAAGAGGC CGGCATCGCG 
TTGAGGAATT CAGCCTTCTC GCCCAATATA AGAGAGCGTA TGGACCACAG CGTCGCGGTT
GTAGACGCCG AGGGCCGCAT TGTGGCTCAG GCAGAGCACA TCCCGGTCCA CCTAGGCTCC
TTCCACGTGG GTGTGCAGAA CCTTCTAGAA TATTTGAGGA GGGAAGGGGT GGAGCTGGAG
GATGGGGACG CTGTGCTCAC GAACGACCCC TACATATCGG GTACGCATCT AAACGACGTG
ATGGTGCTCT ACCCCGTGTT CTGGCACGGG AGGCTCGTCG CTTATATAGC GTCTAAGGCC
CACTACGTCG ACGTCGGGGG GCCGTTGCCG GCGTCGCTAA ACCCGAGGGC TAAGACTATA
TACGAGGAGG GGGTCGTGGT GCCTCCTGTG AAGATATTGA GGCGCGGCGC AATGAACAAG
GAGGCGTTGA GCTTTATCTT GGAGAACTTC AAGACGCCTG TAGTTGCAAG AGGCGATCTC
GAGGCCCAGC TGGCAGCGTC GCGTGTAGGG GCGGCAAGGG TGAAGGACTT ATTCGAGAGG
TTTGGCGACG TGTCGGAGCA GTGGAATGAG GCTATAGAAT ATGGCAGAAG GCTCGCGCTC
GCGGAGATCG CCACGTGGCC TCCCGGCAGA TATGAGGCTG AGGACTACCT AGATTGGGGC
GGCGAGCTCC TCCCCATAAG GCTAGCCCTG GAGATATCTG AGAAGGGCGT GAGGGCAGAC
TTTGAGGGCA CCGCGAGGCA GGTCGACGCG CCGCTAAACG CTGTCATAGG CGTCACCTTC
TCAGCCGTGT CATTTGCGGT GCGGTCAGCC ATAAGGGGCT ACATCCCCAC AAACTACGGC
TTCTACAGCC TCATAAAGCT AGACGCACCT GCAGGTTCCA TAGTAAACCC CCTCAAGCCC
GCTGCTGTGG GCGCCGGCAA CTTGGAGACG AGCCAGAGAG TCGCCGACGT CACGTTTCTG
GCGCTGTCCA AGGCGCTACC TGGGAGGATA CCCGCGGCGG GTTCCGGCAC TATGATGAAC
GTCATGATGG GCGGCTTCTG GCAAGGGCGG TACTGGTCAT ACTACGAGAC AATAGGCGGA
GGGACTGGGG GCAGGCCCAA CGGCCCCGGC GTGTCGGGCG TGCACGTCAA CATGACAAAC
ACGCTGAACA CGCCGATTGA AATTGCGGAG AGGGAGTACC CCATAAGATT CACTGCATAC
CGAATAAGGG AGGGAAGCGG AGGACGCGGG AGGTATCCAG GCGGAGATGG TATAGTTAGG
GCGTTCAAGG CACTGGCACC CACTACGCTG TCCATAATTG CAAGCAGGCT CGCAGTAGGG
CCTTGGGGCC TAGAAGGCGG CGAGCCTGGC AAGCCAGGGA AAATAACTAT AAAGAGAAGC
AGTGGCAGAG TCGAGTCCAT CGGTAGCGAG ACAGTGACGC TAGCAGAGGG AGACGAGGTG
GTCATAGAGA CGCCGGGTGG GGGCGGGTAC GGCAAGCCGT AA
 
Protein sequence
MRWEVVHRAT EYIAEEAGIA LRNSAFSPNI RERMDHSVAV VDAEGRIVAQ AEHIPVHLGS 
FHVGVQNLLE YLRREGVELE DGDAVLTNDP YISGTHLNDV MVLYPVFWHG RLVAYIASKA
HYVDVGGPLP ASLNPRAKTI YEEGVVVPPV KILRRGAMNK EALSFILENF KTPVVARGDL
EAQLAASRVG AARVKDLFER FGDVSEQWNE AIEYGRRLAL AEIATWPPGR YEAEDYLDWG
GELLPIRLAL EISEKGVRAD FEGTARQVDA PLNAVIGVTF SAVSFAVRSA IRGYIPTNYG
FYSLIKLDAP AGSIVNPLKP AAVGAGNLET SQRVADVTFL ALSKALPGRI PAAGSGTMMN
VMMGGFWQGR YWSYYETIGG GTGGRPNGPG VSGVHVNMTN TLNTPIEIAE REYPIRFTAY
RIREGSGGRG RYPGGDGIVR AFKALAPTTL SIIASRLAVG PWGLEGGEPG KPGKITIKRS
SGRVESIGSE TVTLAEGDEV VIETPGGGGY GKP