Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1127 |
Symbol | |
ID | 5103599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1056935 |
End bp | 1058860 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640507020 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001191213 |
Protein GI | 146303897 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.814296 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTGCA GGGTTGCAAT TGACGTTGGT GGGACATTCA CAGACTTCAT AGCCCTAGTG AACGGGGAGA TAGTCACGGT AAAGACTCTA ACAAATCCAA CGAGGCCCTC TCAAGTCATT AAGGACGTCC TGTCCTCGCT TGGATGTGAG GTCACGGAGG TCGTTCACGC GACCACTCTC GCTACCAACG CCCTCCTGGG ACAGGAGAAA CTGGATCTCC CGAGGACGGC GCTCCTCACA ACTAGGAGTT TCAGGGACGT GATAGAGATA GGGAGGCAGA ATCGACCCCG TCTCTACGAC CTCAACTTCG AGAAGCCGAG GCAACTGGTA CCAAGGGAGT TGAGGGTTGA GGTTCAGGAG AGGGTTGACG CTCAGGGTAA CATACTAGAA AGAGTTGAGG AGTCCGAGAT CGAGAAAATT GCCCAGAACC TAAGGGAAAG AGGCGTTGAA TCCGTCGCGA TCAGTTACCT TCACTCTTAC CTTAACCCCA CTAACGAGAT CAGGACTGGG GAAGTCCTCT CTCGTCACTT CAGGTTTGTG TCCCTTTCCT CTGAGGTCGC CCCTGAACCG AGGGAGTATG AGAGAACCTC CACCACCGTG GTTAACGCTG TCCTCATGCC CCTGATCTCC TCGTATCTGC AGGAACTCAA CTTCCTTCCC TCCTTTCTGG TAATGTCAAG CTCAGGTGGC CTAGTTGACG TGGAGGAGGC CTCAAGGAAA CCTGTCCAGC TCGTGGAGTC GGGCCCAGCC GCAGGGGTGA TAGCCTCAGC CTCCCTCTTT CCAGGTAACG TGATCAGCTT TGACATGGGC GGGACCACGG CGAAGGCTGG GGTCGTCATC GACGGAAAGT TCGAGATCAC GACCGAGTAC GAGGTAGGTG GTGAGGTTCA TCACGGCAGG GTGGTGAAGG GTAGCGGTTA CCCCGTGAGG TTCCCCTTCG TGGACTTGGC AGAGGTCTCA GCTGGAGGAG GGACGGTGAT CTGGAGAGAC GACGCTGGGG CCTTAAGGGT TGGTCCCTTG AGTGCAGGGG CAGACCCAGG TCCCATGTGT TACGGAAGGG GAGGTGATAA GCCCACGGTG ACGGACGCAA ACCTTGTCCT GGGAAGGGTG GGGGAGGTGA TCGGTGGAGG GATGAGGCTA AAGCCCGAGT TGGCGAGGAA GGGGCTATCC AGGCTCGGTG ACCTGGAGGA CGTGAGTAGG GATGCCCTTG CCCTGGTAAA CCTGGAGATG GCCAGGGCCA TAAGGCTTGT CACGGTGGAG AGGGGGCTGG ATCCTTCAAG CTTCAGTCTC GTGGCCTTCG GTGGGGCTGG GCCACAGCAC GCGGTTTACC TGGCAGAGGA ACTGGGAATT TCCAAGGTGT TGATTCCACC TTACCCTGGG TTGTTTAGTG CCCTAGGCCT CCTCCTGGCT GACTGGCGCT TTGAGGCTAG GAAATCCTTT CCCAGGGACC TCGAGGCCGA GTTCGTGAAG CTGGAGAGGG AGCTTTACGA CAGGTTGAAG GGGAAGGTGG GTCACTTCCT CAGGTACGCT GACGTCAGGT ATCAGGGCCA GGGCTGGGAG CTCACAGTCC CCGTGAACGA CGTCAACGAG ATCAGGCAAG TCTTTGAGGA GAAGCACCTC TCAACCTACG GCTTCGTGAT GAGCGATAGG GAGATTGAGG TCGTGACCAT AAGGGTGTTC GCCGTGAGAA GGAGACCCCT CCCACAGCTC TCAGTTGTGT CGGGGCAGGG GGACAGCCCC GTCAAGAGGA GGAAGGCCCT CCTAGAGGAC GAGTGGGGTG AAGTGGACGT GTACGTTAGG GAGAAGTTGA GGAGGGGGGT TAGGGTGAGA GGTCCCGCAA TCATAGAGGA GTTCAGCTCC ACTACGGTGG TCAAGGACGG GTGGGAGGCC CTAGTAGACG AGTCCATAAC CTTGGTGAGA CCATGA
|
Protein sequence | MECRVAIDVG GTFTDFIALV NGEIVTVKTL TNPTRPSQVI KDVLSSLGCE VTEVVHATTL ATNALLGQEK LDLPRTALLT TRSFRDVIEI GRQNRPRLYD LNFEKPRQLV PRELRVEVQE RVDAQGNILE RVEESEIEKI AQNLRERGVE SVAISYLHSY LNPTNEIRTG EVLSRHFRFV SLSSEVAPEP REYERTSTTV VNAVLMPLIS SYLQELNFLP SFLVMSSSGG LVDVEEASRK PVQLVESGPA AGVIASASLF PGNVISFDMG GTTAKAGVVI DGKFEITTEY EVGGEVHHGR VVKGSGYPVR FPFVDLAEVS AGGGTVIWRD DAGALRVGPL SAGADPGPMC YGRGGDKPTV TDANLVLGRV GEVIGGGMRL KPELARKGLS RLGDLEDVSR DALALVNLEM ARAIRLVTVE RGLDPSSFSL VAFGGAGPQH AVYLAEELGI SKVLIPPYPG LFSALGLLLA DWRFEARKSF PRDLEAEFVK LERELYDRLK GKVGHFLRYA DVRYQGQGWE LTVPVNDVNE IRQVFEEKHL STYGFVMSDR EIEVVTIRVF AVRRRPLPQL SVVSGQGDSP VKRRKALLED EWGEVDVYVR EKLRRGVRVR GPAIIEEFSS TTVVKDGWEA LVDESITLVR P
|
| |