Gene Pars_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0413 
Symbol 
ID5054589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp360854 
End bp362800 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content59% 
IMG OID640467978 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001152665 
Protein GI145590663 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.96856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTTG TGGGAGTTGA CGTGGGAGGG ACTTTTACCG ACTTTGTATT CCTAGACGAG 
GGGGGCGAGA TCAAGACGCT CAAGATATTG TCGACTCCTA GGGAGCCCGA AAAGGCGGTG
ATTGAGGGGC TCTCGGCGGT TAAGTTCTCG GAGGTTCTCC ACGCGTCGAC TATAGGAACA
AACGCGTTGT TGGGACAAAT GGGGCTCGAG GTGCCAAGGG TAGCATTCTT CACCACGAGG
GGGTTCCGCG ATGTAGTTGA GATCGGGAGA CAGAACCGGC CTAGGCTCTA CGACTTATTT
TTCCAAAAGC CGAGGCCCCT AGCCCCAAGA GAGCTGAGGT TTGAGGTAGA CGAGAGGACT
CTGCCAGATG GGAGAGTGGA AAAGGCCGTG GATTTAGGAG AAGTGGCAGA GCTCGCTAGG
AAGGCCAAGG CCGCGGGTGC CATGAGCGTG GCTGTGGGTT TTCTCCACTC CTACGCAAAT
CCCTCAAACG AGGAGGTAGC GGCGAAGTTG CTGAGGGAGT ACTTCGAGTA CGTGACGGCG
TCCTACGAGG TGGCTTGGGA GCCCAGGGAG TACGAGAGGT TTTCCACTGC GCTGGTAAAC
GCCGCGTTGA TGCCGCTTGT GGGGAGGTAT CTGGCCAAGC TACAGAGCTA TGTGGAGTCG
CGGGGAGGGA AGATGTACGT CATGGCGAGT TCCGGCGGTC TGGTGACAGT AGAGGAGGCG
GCGAAGAGGC CTGTACAGCT TGTCGAGTCT GGGCCTGCCG CGGGCGTAAT AGCCGCGGCC
GAGCTCGCCA AGCTGTTGGG CGAGGGCCGC GTAATATCTT TCGATATGGG CGGCACCACG
GCCAAGGCGG GGACTGTTGT CGATTTTCAG CCTTCCATAA CAACGGAGTA CGAGGTGGGG
GGCGAGAGCC ACCGGGGCAG AGTTATTAAG GGGTCGGGAT ACCCGGTGCG TTTTCCCTTT
GTTGACCTTG CCGAGGTTTC GGCCGGGGGC GGGACGATAA TATGGAGGGA CGCAGGCGGT
GCGCTAAGGG TCGGCCCGTT GAGCGCCGGC GCCGATCCCG GCCCCGTCTG CTACGGCAGA
GGCGGCGTCG ATCCCACGGT TACAGATGCC AACTTGGCGC TTGGGAGAAT TCCGGAGGCG
CTGGCGGGCG GCCGCATGAG GCTTGACGCC GAGGCGGCGA AGAGGGCGCT CGCCAAGCTC
GGCGACCCCG TAGATGTGGC AAGTTCGGCA CTAAGGCTCA TCAACTTAGA GATGGCTAGG
GCTATTAGGC TAGTCACTGT GGAAAGGGGC CTTGACCCCT CTTCCTTTGT CTTGATGGCT
TTTGGTGGGG CTGGTCCGCA ACACGCCACT GAGGTGGCGG AGGAAATGGG GATAAACCGC
GTGCTCATAC CGCCTATGCC TGGAGTCTTC ACATCGCTGG GGATGCTTAT GGCGGACTTC
AAGTTCGAGG CCCGTATGGC TTATCCTAAG GACATAGCAA AGGGCTTTGC TGAGCTTGAG
GAAAAGTTGT CCCAGCATCG GCCTGACTAC TTCTTGAGAT ACGCCGACGT TAGGTATAAG
GGGCAGGGGT GGGAATTAAC TGTGCCTTTA GGCGCCGACG CATCCTATGA TGCAGTTAAG
AGGGCGTTTG AGGAGAAGCA TACGGCGACC TACGGCTTTA AGCTGGACAG AGATATAGAG
GTTGTCACAA TCCGCGTCTT TGCCGTGGTG AGGCGGGCTA AGCCGCGGCT ACCCGAGCCA
CCTACAAAAG GCAACCCCAG CGTCGCCGAG AAGGAGGTTT ACTTCGATGG GTGGGTAAAG
GCCGCTGTGT ATAATAGGGC AGAGTTGCCG CTGGGCTACA AGATCAAGGG GCCCGCTCTG
ATTGTTGAGG ACTACTCGAC TACAGTAATC CCGCCACGTT GGGAGGCGAT GGTGGGCAAG
TACGGCGTGC TGGAGCTGAG GCTATGA
 
Protein sequence
MGLVGVDVGG TFTDFVFLDE GGEIKTLKIL STPREPEKAV IEGLSAVKFS EVLHASTIGT 
NALLGQMGLE VPRVAFFTTR GFRDVVEIGR QNRPRLYDLF FQKPRPLAPR ELRFEVDERT
LPDGRVEKAV DLGEVAELAR KAKAAGAMSV AVGFLHSYAN PSNEEVAAKL LREYFEYVTA
SYEVAWEPRE YERFSTALVN AALMPLVGRY LAKLQSYVES RGGKMYVMAS SGGLVTVEEA
AKRPVQLVES GPAAGVIAAA ELAKLLGEGR VISFDMGGTT AKAGTVVDFQ PSITTEYEVG
GESHRGRVIK GSGYPVRFPF VDLAEVSAGG GTIIWRDAGG ALRVGPLSAG ADPGPVCYGR
GGVDPTVTDA NLALGRIPEA LAGGRMRLDA EAAKRALAKL GDPVDVASSA LRLINLEMAR
AIRLVTVERG LDPSSFVLMA FGGAGPQHAT EVAEEMGINR VLIPPMPGVF TSLGMLMADF
KFEARMAYPK DIAKGFAELE EKLSQHRPDY FLRYADVRYK GQGWELTVPL GADASYDAVK
RAFEEKHTAT YGFKLDRDIE VVTIRVFAVV RRAKPRLPEP PTKGNPSVAE KEVYFDGWVK
AAVYNRAELP LGYKIKGPAL IVEDYSTTVI PPRWEAMVGK YGVLELRL