Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1877 |
Symbol | |
ID | 5104145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1820118 |
End bp | 1821152 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507763 |
Product | chorismate mutase / prephenate dehydrogenase |
Protein accession | YP_001191941 |
Protein GI | 146304625 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0287] Prephenate dehydrogenase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01791] chorismate mutase, archaeal type [TIGR01799] chorismate mutase domain of T-protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.786603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAAC TCGACCAGCT AAGGGCTGAG ATCGACAGGG TTGATGAGGA GCTTTTTAAA CTATTTTTTA AACGCCTTGA ACTTGTTTCT GAAATTGGTC ATCTTAAAAA GAAGGAAGGT CTCCCTGTCA CCGACGAGAG GAGGGAGAGT GAGGTCAGGG AGAGATGGAG AAGCTTAGCT AGAGCCTATG GAATTCCCGA AACCCTGGCG GATAACCTCC TCTCGACCAT GTTCTCTGTA GCCAAGATGA GGGAGGTCAA TCCCTCGGAA AAGAGAAAAA TAACCCTCGT TGGTTACGGC GGGATGGCTA GGTCGCTGGC CTCCCTGTTC AAGCTCGCAA AGCACGAGGT TGTGATAACG GGAAGAAGCC AGGAGAAGTC CCAGAAGCTC GCGATAGATT TCAACTTCAC CTACATGCCC ATGCCACAAG CCCTTCAATG GGGCGAGATC GTGATTTTGG CCCTTCCACC TGAGGGTGTG TTCTCGGAGA ACGTCACCAG GTTTCTCCAC CTGTCCAAGG ACAGGGTGGT AATGGATATC CTCTCGAGCA AGACCAGATT CTTTGGGAAA CTCGAGGAAA TGTCCAGGCA GATGGGATTC AGGTTCGTGT CGACACACCC GCTCTTTGGT CCCTACCTGA ACCCTGTGGG AGAGAAGATA GTCCTGATCC CCTCTGAAAC CACGGGAGAC CTCGAGGAGA TATCGGAGTT CTGGAGGGGG GTAGGACTGA CCCCCCTAAT CACAGACGTA GATACACACG AGAAGTTAAT GGCCGTGGTT CAGGTTTTAC CCCACTTTTT CATCCTGGGC CTTTCCAGTA GCTTGGACCT CCTCTCCAGG GAGCTCAACG TCGACTTCTC CCAGTTTCAG ACAACCAACT TCAGGGAGAT ATACAAGATT GTGAGGAGGG TAAAGGAGTT GGAGCCAGTC ATACTGGAGA TACAGAGAAT GAACCCCTAC GCGGAGCAGG CGAGAAGGCT TGGACTTAGA GAGTTAAATA CTCTTTTCTC TACTCTTCAA GAGGAAAAGA AATGA
|
Protein sequence | MKELDQLRAE IDRVDEELFK LFFKRLELVS EIGHLKKKEG LPVTDERRES EVRERWRSLA RAYGIPETLA DNLLSTMFSV AKMREVNPSE KRKITLVGYG GMARSLASLF KLAKHEVVIT GRSQEKSQKL AIDFNFTYMP MPQALQWGEI VILALPPEGV FSENVTRFLH LSKDRVVMDI LSSKTRFFGK LEEMSRQMGF RFVSTHPLFG PYLNPVGEKI VLIPSETTGD LEEISEFWRG VGLTPLITDV DTHEKLMAVV QVLPHFFILG LSSSLDLLSR ELNVDFSQFQ TTNFREIYKI VRRVKELEPV ILEIQRMNPY AEQARRLGLR ELNTLFSTLQ EEKK
|
| |