Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0413 |
Symbol | |
ID | 5054589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 360854 |
End bp | 362800 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640467978 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001152665 |
Protein GI | 145590663 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.96856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCTTG TGGGAGTTGA CGTGGGAGGG ACTTTTACCG ACTTTGTATT CCTAGACGAG GGGGGCGAGA TCAAGACGCT CAAGATATTG TCGACTCCTA GGGAGCCCGA AAAGGCGGTG ATTGAGGGGC TCTCGGCGGT TAAGTTCTCG GAGGTTCTCC ACGCGTCGAC TATAGGAACA AACGCGTTGT TGGGACAAAT GGGGCTCGAG GTGCCAAGGG TAGCATTCTT CACCACGAGG GGGTTCCGCG ATGTAGTTGA GATCGGGAGA CAGAACCGGC CTAGGCTCTA CGACTTATTT TTCCAAAAGC CGAGGCCCCT AGCCCCAAGA GAGCTGAGGT TTGAGGTAGA CGAGAGGACT CTGCCAGATG GGAGAGTGGA AAAGGCCGTG GATTTAGGAG AAGTGGCAGA GCTCGCTAGG AAGGCCAAGG CCGCGGGTGC CATGAGCGTG GCTGTGGGTT TTCTCCACTC CTACGCAAAT CCCTCAAACG AGGAGGTAGC GGCGAAGTTG CTGAGGGAGT ACTTCGAGTA CGTGACGGCG TCCTACGAGG TGGCTTGGGA GCCCAGGGAG TACGAGAGGT TTTCCACTGC GCTGGTAAAC GCCGCGTTGA TGCCGCTTGT GGGGAGGTAT CTGGCCAAGC TACAGAGCTA TGTGGAGTCG CGGGGAGGGA AGATGTACGT CATGGCGAGT TCCGGCGGTC TGGTGACAGT AGAGGAGGCG GCGAAGAGGC CTGTACAGCT TGTCGAGTCT GGGCCTGCCG CGGGCGTAAT AGCCGCGGCC GAGCTCGCCA AGCTGTTGGG CGAGGGCCGC GTAATATCTT TCGATATGGG CGGCACCACG GCCAAGGCGG GGACTGTTGT CGATTTTCAG CCTTCCATAA CAACGGAGTA CGAGGTGGGG GGCGAGAGCC ACCGGGGCAG AGTTATTAAG GGGTCGGGAT ACCCGGTGCG TTTTCCCTTT GTTGACCTTG CCGAGGTTTC GGCCGGGGGC GGGACGATAA TATGGAGGGA CGCAGGCGGT GCGCTAAGGG TCGGCCCGTT GAGCGCCGGC GCCGATCCCG GCCCCGTCTG CTACGGCAGA GGCGGCGTCG ATCCCACGGT TACAGATGCC AACTTGGCGC TTGGGAGAAT TCCGGAGGCG CTGGCGGGCG GCCGCATGAG GCTTGACGCC GAGGCGGCGA AGAGGGCGCT CGCCAAGCTC GGCGACCCCG TAGATGTGGC AAGTTCGGCA CTAAGGCTCA TCAACTTAGA GATGGCTAGG GCTATTAGGC TAGTCACTGT GGAAAGGGGC CTTGACCCCT CTTCCTTTGT CTTGATGGCT TTTGGTGGGG CTGGTCCGCA ACACGCCACT GAGGTGGCGG AGGAAATGGG GATAAACCGC GTGCTCATAC CGCCTATGCC TGGAGTCTTC ACATCGCTGG GGATGCTTAT GGCGGACTTC AAGTTCGAGG CCCGTATGGC TTATCCTAAG GACATAGCAA AGGGCTTTGC TGAGCTTGAG GAAAAGTTGT CCCAGCATCG GCCTGACTAC TTCTTGAGAT ACGCCGACGT TAGGTATAAG GGGCAGGGGT GGGAATTAAC TGTGCCTTTA GGCGCCGACG CATCCTATGA TGCAGTTAAG AGGGCGTTTG AGGAGAAGCA TACGGCGACC TACGGCTTTA AGCTGGACAG AGATATAGAG GTTGTCACAA TCCGCGTCTT TGCCGTGGTG AGGCGGGCTA AGCCGCGGCT ACCCGAGCCA CCTACAAAAG GCAACCCCAG CGTCGCCGAG AAGGAGGTTT ACTTCGATGG GTGGGTAAAG GCCGCTGTGT ATAATAGGGC AGAGTTGCCG CTGGGCTACA AGATCAAGGG GCCCGCTCTG ATTGTTGAGG ACTACTCGAC TACAGTAATC CCGCCACGTT GGGAGGCGAT GGTGGGCAAG TACGGCGTGC TGGAGCTGAG GCTATGA
|
Protein sequence | MGLVGVDVGG TFTDFVFLDE GGEIKTLKIL STPREPEKAV IEGLSAVKFS EVLHASTIGT NALLGQMGLE VPRVAFFTTR GFRDVVEIGR QNRPRLYDLF FQKPRPLAPR ELRFEVDERT LPDGRVEKAV DLGEVAELAR KAKAAGAMSV AVGFLHSYAN PSNEEVAAKL LREYFEYVTA SYEVAWEPRE YERFSTALVN AALMPLVGRY LAKLQSYVES RGGKMYVMAS SGGLVTVEEA AKRPVQLVES GPAAGVIAAA ELAKLLGEGR VISFDMGGTT AKAGTVVDFQ PSITTEYEVG GESHRGRVIK GSGYPVRFPF VDLAEVSAGG GTIIWRDAGG ALRVGPLSAG ADPGPVCYGR GGVDPTVTDA NLALGRIPEA LAGGRMRLDA EAAKRALAKL GDPVDVASSA LRLINLEMAR AIRLVTVERG LDPSSFVLMA FGGAGPQHAT EVAEEMGINR VLIPPMPGVF TSLGMLMADF KFEARMAYPK DIAKGFAELE EKLSQHRPDY FLRYADVRYK GQGWELTVPL GADASYDAVK RAFEEKHTAT YGFKLDRDIE VVTIRVFAVV RRAKPRLPEP PTKGNPSVAE KEVYFDGWVK AAVYNRAELP LGYKIKGPAL IVEDYSTTVI PPRWEAMVGK YGVLELRL
|
| |