Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1876 |
Symbol | |
ID | 5104144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1819129 |
End bp | 1820121 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640507762 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_001191940 |
Protein GI | 146304624 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCTCT ACATATTGAA GGGCAAGGAT CACTCCTCAC TAAAGGAGAA ACTTAAGTCT AGCTCAGCCT CATTCAAGTT CCTCAACCTG TACGGAAAGG AGCTCGCACT GGCCTGGCCC GACTCTGCAG TTGAGGGGAT AACGGATGAA TCCGTGGAGC TAGTGGTGAA GACCAAGAAG TCCTACATCC TGGCTGGAAA CGAATGGAAA AAGGATCCAA CAGTGGTGAA GGTGAAGGAC GTGGAGATAG GATCCAAGAG GGTTGTGGTC GCTGCCGGGC CATGTGCGGT GGAGTCCATG GAGCAGACCG AGACCGTGGC CAAGGCCGTG AAAAGGGCTG GTGCCTCACT ACTCAGGGGT GGAGCCTACA AGCCTAGGAC GAGCCCCTAC TCCTTCCAGG GACTGGGAGA GGAAGGGCTA AAGATACTCA GGAAGGCTGG CGACGAGACA GGGTTACCTG TGGTTTCAGA GATCCTTGAC GCGAGGGACG CGGGAGCCTT TGCAAAGTAT GCTGACATGG TCCAGATAGG CGCTAGGAAC TCGCAGAACT TCACCCTTTT GCGGGATGTG GGAAAGCTGG GCAAACCCGT CTTGCTAAAG AGAGGTCTAG GGAACACGGT GGAGGAACTA ATACAGTCTG CGGAATACGT AATGATGGAG GGGAACGGCA ACGTGGTCCT CTGCGAAAGG GGAATAAGAA CCTTTGAGAA GTCCACCAGG TTCACCCTAG ATATTGGAGG AATGGTTGCG GGGAAGCTAA TGACGCACCT CCCCTTCTGC GCGGATCCGA GTCATCCTGC GGGGAAGAGG GAACTGGTTC ACTCCCTTGC CCTAGCCTCT GTGGCAGCCG GGGCAGACAT GCTCCTTGTG GAAGTTCATC CCAGGCCTGA GGTGGCACTG AGCGACTCTG AACAGCAACT GACCCCGGAG TCCTTTGAAT TATTGATGGA GAGAGTCAAG GCATTGGCTT CGGTTCTAGG TAGATCCGCA TGA
|
Protein sequence | MILYILKGKD HSSLKEKLKS SSASFKFLNL YGKELALAWP DSAVEGITDE SVELVVKTKK SYILAGNEWK KDPTVVKVKD VEIGSKRVVV AAGPCAVESM EQTETVAKAV KRAGASLLRG GAYKPRTSPY SFQGLGEEGL KILRKAGDET GLPVVSEILD ARDAGAFAKY ADMVQIGARN SQNFTLLRDV GKLGKPVLLK RGLGNTVEEL IQSAEYVMME GNGNVVLCER GIRTFEKSTR FTLDIGGMVA GKLMTHLPFC ADPSHPAGKR ELVHSLALAS VAAGADMLLV EVHPRPEVAL SDSEQQLTPE SFELLMERVK ALASVLGRSA
|
| |