Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2063 |
Symbol | |
ID | 5105043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1982017 |
End bp | 1982928 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640507953 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_001192127 |
Protein GI | 146304811 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTATC AGCTTGAACT TATTATGCAC CACATTCTAG GTGTTTACAT GAAACAGATT ATTGTTGCTA ACGTTACACC CTTCGATGAG AAAGGGAACC TGGACCTAGA GGGGTTAAGG ACCCTTTACA ATTTTGACCT GAGCAGAGGC GCAACGGGTT TCTGGGTGAT GGGTACCACT GGAGAATGTA AGATGCTCAC TTATTCCGAG AAGCTAGCTG TGGCTAAGGC GTCAATAGAG ACCCTAGGGA ACAAGGCTAT CATTGGAATA AACGAGGAGT CCACTGACAA CGCAGTGAAG CTGGCTAAGG AGATCGTCGA TATGGGTGCC TCAAAGATAT TCTCGCTTCC ACCCATCTAT CACAAGCCAT CAGAACTTGG ACTTTTCAAG TTCTTTGAAT CCATTTCCAA GATAGGTATT CCAGTCTACG TTTATAATAT ACCCAGTTAT GTTGGATATA ATATAGACCT CAACCTCACT GGGAAAATGG CAGAGGAGGG CATTATACAG GGAATGAAAT ACACCACCAA CGACTTAGTG TCATTTCACG AGTATACAAG ATTAAAACAG GACCATAAGG AATTTGAGAT ACTCATGGGT ACCGAACACC TCATCCTCCC ATCTCTGATG TATGGTGGTG ACGGGGTAGT AACAGCAGTA GCCAACTTCG CTCCAGAATT TGTGAAGAAC ATTTTTGATT CCTTCGAAAA GGGTGATATC TTGAAGGCCA TGGAGGACCA GTATAAGGTC ATAAAGTTGG CATCTGTAGT TTCTGGAGAA GACTACCCTG CAGGAGTTAA GATTGCCCTT AGGTATAGGG GTATATACGT GGGAAGAGTT AGGGAACCAC TCCAGGAAGA CATAAATCGT GAGGGGGTAA TATACGCAAC GTTGAAGGAG TTCGGACTCT AA
|
Protein sequence | MLYQLELIMH HILGVYMKQI IVANVTPFDE KGNLDLEGLR TLYNFDLSRG ATGFWVMGTT GECKMLTYSE KLAVAKASIE TLGNKAIIGI NEESTDNAVK LAKEIVDMGA SKIFSLPPIY HKPSELGLFK FFESISKIGI PVYVYNIPSY VGYNIDLNLT GKMAEEGIIQ GMKYTTNDLV SFHEYTRLKQ DHKEFEILMG TEHLILPSLM YGGDGVVTAV ANFAPEFVKN IFDSFEKGDI LKAMEDQYKV IKLASVVSGE DYPAGVKIAL RYRGIYVGRV REPLQEDINR EGVIYATLKE FGL
|
| |