Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1449 |
Symbol | |
ID | 5104819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1416100 |
End bp | 1417185 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507337 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001191530 |
Protein GI | 146304214 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTACT CTCCACGGAA CTCGTTTAAG TTCTGTATCC TTGGGGGAGG ACAACTTGGC TGGATGATGG TTCTTGAGGG ACTGAAGTTT CCAATCTCCT TCCACGTATA CGGAGAGAAA GAGGATCCAG CCTGCAAGAT TGCCAACTGC TTCAAGGAAG AGTACAGGAA GGTTATTGAG GAGTGCGACG TCGTCACATA CGAGTTTGAG CACGTGGATG ATAAGCCACT TGAACTAGCT AGGGACCTTA ACAAGCTAAT GCCTGGAATG AATGCGGTCG AGCTCAAGAG GGTGAGACAT CTAGAGAAGG AATTTCTTAG AAGAGAAGGA TTACCGGTAC CGCGTTTCGT CACTGTGAGG GGTGGAGATG AGGCACTTAG GGTTCTCAAG AACGAGTTCA ATGGCACGGG AGTTATAAAG AGATCCAAGG GTGGTTACGA CGGAAAGGGG CAGTTCTTCG TGAGGGGAGA CCCTGAAAAA TACTCTTTCC TTAGGGATGA GAACGATTAC TTCGTTGTTG AGGAACTGGT CAACTTCGAC TATGAGGCCT CAATAATAGC TGTGAGGAGG GGGAACGAGT TCAGGGCCTA TCCTCCGACG TTCAATTACA ACGAGAAGGG AATCCTCGTC TATAATTACG GGCCCTTCGG TAACGAGGAG ATGGTGAAGA TTGCCGAGGA ACTCACGAGA AAGTTAAACT ACACTGGCGT AATAGGAATA GAGTTCTTCG TTAAGGATGG AAAGGTTCTC ATCAACGAGT TTGCTCCCAG GGTTCATAAC ACGGGCCATT ACACCTTGGA CGGAGCTGAG GTATCCCAGT TTGAACAACA CGTTAGGGCC CTGGCTGGAT TGGAACTAGG GAGTACCAAA GTGCTTACCT TCTCGGGGAT GATAAACATA CTGGGCATAG CCTCTCCTCC CATGGAGATC CTAAAGCTCG GAACCCTATA CTGGTATGGA AAAAGCGAGG CTAGGAAGAG GAGGAAGATG GGGCACGTGA ACGTTCTAGG AGATGATCTG GCTGAAGTTA AGGAAAAGAT TGAAAATGTT ATGAATATAT TATATCCCAA TGGGCCTGAT CTATGA
|
Protein sequence | MTYSPRNSFK FCILGGGQLG WMMVLEGLKF PISFHVYGEK EDPACKIANC FKEEYRKVIE ECDVVTYEFE HVDDKPLELA RDLNKLMPGM NAVELKRVRH LEKEFLRREG LPVPRFVTVR GGDEALRVLK NEFNGTGVIK RSKGGYDGKG QFFVRGDPEK YSFLRDENDY FVVEELVNFD YEASIIAVRR GNEFRAYPPT FNYNEKGILV YNYGPFGNEE MVKIAEELTR KLNYTGVIGI EFFVKDGKVL INEFAPRVHN TGHYTLDGAE VSQFEQHVRA LAGLELGSTK VLTFSGMINI LGIASPPMEI LKLGTLYWYG KSEARKRRKM GHVNVLGDDL AEVKEKIENV MNILYPNGPD L
|
| |