Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1700 |
Symbol | |
ID | 5105346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1639743 |
End bp | 1640789 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507594 |
Product | aspartate-semialdehyde dehydrogenase |
Protein accession | YP_001191779 |
Protein GI | 146304463 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0136] Aspartate-semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR00978] aspartate-semialdehyde dehydrogenase (non-peptidoglycan organisms) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.109656 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.328927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAAC TGAAGGTATC TCTTCTGGGA GCCACGGGGA TGGTAGGGCA AAAGATGGTT AGGTTGCTCT CCTCTCATCC CTACATAGAA CTCACCAAGG TTAGCGCTTC CCCGGGCAAG ATAGGGAAAA GATACATTGA GGCAGTGAAG TGGGTTGAGG GTGGAGAGGT ACCAGAGCAG GCTAGGGACC TCAAGATAGT GTCTACCGAG CCAGAGGATC ACAAAGACGT AGACGTTGTC CTTTCGGCCT TACCCAATGA GCTAGCTGAG GGGATAGAGC TCAAACTTGT TAGGGAAGGG ATAACCGTGG TGTCGAATGC AAGTCCATTC AGAATGGACC CAGAGGTTCC ACTAATAAAT CCCGAGGTAA ACTGGGATCA TCTGAAGCTT CTCGAAACCC AAAGACAGAA GAGGGGTTGG AAGGGCCTTC TTGTAAAGAA CCCCAATTGT ACTGCTGCCA TAATGAGCAT GCCGATCAAA CCCCTTCTTA AGTACCGCTT AAATCACATG ATCATAACCA CCCTCCAGGC GGTAAGCGGT GCAGGATATA ACGGTCTTTC CTTCATGTCA ATCACAAACA ACGTTATACC CTTCATAAAG GGGGAGGAGG AAAAGATCCC TAAGGAATCC GGGAAGATGC TTGGGACACT AGTAAACGAC TCGATCCGCC ACGTGGAGCT CAAAGCATTG GTAACCTCCA CGAGGGTTCC GGTCAAGGTG GGACATATGG GGGTAATGTA CCTGTTCTTT GACTCCCCGG TTAATGCTGA GGAGGTTAAG AGGGATCTTT CCTCTTTCAA ATCCTTACCC CAAGAGAGGA ACTTACCCAC TGCTCCCAAG ACCCCAATCA GGGTACTAGA GGGGGAGGAT AGGCCTCAAC CTGAGATAGA CGTTAGTGCT GAGAGGGGAA TGGCCATTAG CGTGGGGAGA GTAAAGAACG AAAATGGGGC TCTCAGGATG GTTGTACTAG GGGATAACTT GGTTAGGGGC GCAGCAGGGA TAACCATTCT CACTTTGGAG GTTATGAAAG AGCTGGGCTA CGTATGA
|
Protein sequence | MDKLKVSLLG ATGMVGQKMV RLLSSHPYIE LTKVSASPGK IGKRYIEAVK WVEGGEVPEQ ARDLKIVSTE PEDHKDVDVV LSALPNELAE GIELKLVREG ITVVSNASPF RMDPEVPLIN PEVNWDHLKL LETQRQKRGW KGLLVKNPNC TAAIMSMPIK PLLKYRLNHM IITTLQAVSG AGYNGLSFMS ITNNVIPFIK GEEEKIPKES GKMLGTLVND SIRHVELKAL VTSTRVPVKV GHMGVMYLFF DSPVNAEEVK RDLSSFKSLP QERNLPTAPK TPIRVLEGED RPQPEIDVSA ERGMAISVGR VKNENGALRM VVLGDNLVRG AAGITILTLE VMKELGYV
|
| |