Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1128 |
Symbol | |
ID | 5103600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1058857 |
End bp | 1060389 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640507021 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001191214 |
Protein GI | 146303898 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCCT GGGAAATAGT GAGCAGGGCC CTCGAGTTCG TCGCCGAGGA GATGGGGGTA ATGCTCAAGA GGTCTGCGAT GTCGCCCAAC ATAAGGGAGA GGATGGATCA CAGCTGTGCC ATCGTCAACG AGAGGGGTGA GGTCGTGGCC CAGGCGGAGC ACATCCCCGT TCACCTGGGA TCGTTCAGCG TTGGGGTCAA GAATCTCCTG CAACAGGTTG AGCTCGAGGA AGGGGATATG GCGATCGTGA ATGATCCCTA CGTGTCGGGG ACACACCTCA ACGACGTCAT GGTAATGGCA CCGTTACCTG GGGGACTGGG ATATGTGGTG AACAAAGCAC ATCACGTTGA CGTTGGAGGG CCGATGCCGG GAAGCCTCAA CCCCTCAGCC TCCACGCTGT ATGAGGAGGG TTTCGTCATA CCCCCTCTCA AGTTGATGAG GAAGGGGAAA CTCAACGAGG ATGTGGTGAA GATGATCAGG GAGAACTTTA AGGTCCCTGA CACCTCACTT GGGGACCTGA ACGCCCAGAT CTCGGCTAAC CTTGTGGGGA TAGCCAGGGT AAAGCAACTA GTGGAGAGGT ACGGCGTCTC GCAGGTGGTT GAAGCATGGA ACACCTCAAT GGAGTACGCA AGGAGGCTCA CGTTGACAGA GCTGGCGAGA TGGCCCAAGG GGTCTGGGGA GGACGAGGAC TATCTGGAAC TTGATAGGCT GACCAGGATT AGGGCGAGGG TGGAGATTAG CGACCAGGGA GTCTTGGCCG ACTTCACGGG CACGGATCCC CAGGTTGAGT CCCCCTTTAA CGCCGTGTAT GGGGTAACGT TCTCCGCGGT TAGCTTCGTG ATAAGGTCTC TCCTTGGGAA GGACGTCCCA ACAAACGAGG GCTTCTATAG CGTGGTCAGG GTTAAGGCAC CCCTGGGGAG CCTCGTGAAC CCCACGAAGC CGTCAGCTGT GGGTGGAGGA AACGTGGAAA CCTCACAGAG GATAGCTGAC GTCACGTTCA AGGCCCTGTC CCACCTGATG CCCGTCCCGG CAGCCGGGTC CGGGACAATG ATGAACATCA TGATGGGAGG GTTGAGAGGG AGGTACTGGG CCTACTACGA GACTGTGGGA GGAGGCATGG GGGCCAGGCC TAACCGCGAC GGAGTCTCAG CGGTCCAGGT CAACATGACC AACACCTTGA ACACGCCTAT AGAGATAGCG GAAAGGCAGT ACCCCCTTCT CTTCACGGCA TACAGGGTGA GGGAGGGAAG CGGAGGAAGA GGGAGGTTCA AGGGAGGTGA CGGGATAGTT AGGGCCTTCA AGGTCATGGA CAGGACGAGG TTGTCAGTGA TGGCTGAGAG GTTCCTTATC CCACCTTGGG GGCTATTAGG TGGTGGAAAC GGGAAACCAG GAAGGGTGAC GATCGTGAGA GGTGGAGTCA GAAGGGAGAT GCCCAGTAAG TTCTCAACTG TTCTGGAGTC AGGGGACGAG GTCGTGATAG AGACGCCAGG AGGCGGGGGA CTAGGAAAAG AGGAGGAAAC CTCGGAAGGC TAA
|
Protein sequence | MTSWEIVSRA LEFVAEEMGV MLKRSAMSPN IRERMDHSCA IVNERGEVVA QAEHIPVHLG SFSVGVKNLL QQVELEEGDM AIVNDPYVSG THLNDVMVMA PLPGGLGYVV NKAHHVDVGG PMPGSLNPSA STLYEEGFVI PPLKLMRKGK LNEDVVKMIR ENFKVPDTSL GDLNAQISAN LVGIARVKQL VERYGVSQVV EAWNTSMEYA RRLTLTELAR WPKGSGEDED YLELDRLTRI RARVEISDQG VLADFTGTDP QVESPFNAVY GVTFSAVSFV IRSLLGKDVP TNEGFYSVVR VKAPLGSLVN PTKPSAVGGG NVETSQRIAD VTFKALSHLM PVPAAGSGTM MNIMMGGLRG RYWAYYETVG GGMGARPNRD GVSAVQVNMT NTLNTPIEIA ERQYPLLFTA YRVREGSGGR GRFKGGDGIV RAFKVMDRTR LSVMAERFLI PPWGLLGGGN GKPGRVTIVR GGVRREMPSK FSTVLESGDE VVIETPGGGG LGKEEETSEG
|
| |