Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0215 |
Symbol | |
ID | 5104081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 176688 |
End bp | 177698 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506120 |
Product | delta-aminolevulinic acid dehydratase |
Protein accession | YP_001190316 |
Protein GI | 146303000 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0113] Delta-aminolevulinic acid dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000475247 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0306727 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGGAT ATCCCAAGAT AAGACCAAGG AGATTAAGAC AGAACAAGAA TATAAGGGAC GCAGTAGCGG AGACGAAACT GACTCATGAT AACCTAATCT TACCCATCTT TGTTAAGGAG GGTATATCTA AGCCCGAGGA AATTTCCTCA ATGCCTGACG TTTATAGGTA CCCTGTGGGT GATCCCCTAA TTAAGTTCGT AGAGGGGAAT TATTCCAAGG GAATAAGGAA GGTTATCCTG TTCGGGATTC CATCCTTCAA GGATAACATA GCGAGCTCCG CATATCAGAA GGATGGAGTA ATTCAGAGGT CGTTAAAGCT ATTGAAGGAA ACATTTGGAG ACAAGATACT TCTCTTCGCA GACGAATGCA CTGACGAGTA CACTAGTCAT GGACACTGTG GGATAGTGAA TTACAGGGGG AAACAATATT ACATTGACAA CGATGAGAGC TTGAAGGTCC ACGCGAAGAT AGCCCTGTCT CAAGCCGAGG CAGGTGCCGA CGTGATTGCA CCCTCTAGCA TGATGGATGG AGTTGTTGGC GCCATTAGGG AAGAGCTTGA CAGAAATGGC TTTACTGATA CCCTTATCAT GTCTTATAGC GTGAAGTACG CCTCAGTCTT CTATTCCCCG TTTAGAGAGG CAGCCAGCTC AGCCCCTGCA TTTGGGGACA GGAAAAGCTA CCAGATGGAC CCACGAAACG CCAACGAGGC GATAAAGGAG GCCAGATTGG ACTTAGAGGA GGGGGCTGAT ATACTTATGG TGAAACCGGC CCACACTTAC CTAGACGTGA TAAGGCTGGT AAAGGAGACC TATCCCGAAT ATCCCCTAGC AGCATATCAT GTTAGCGGAG AGTATTCCAT GATCAAGGCC GCGGCCATAA ACGGTTGGTT GAACGAGAAG GTGGCCGTCC TCGAGATCAC TCACGCCATT AGGCGTGCGG GGGCTGATAT GATCCTGACC TATTACGCTC CAAAACTGGC AGAGTGGATT TTGGAGGCGA GTCCGTTTTG A
|
Protein sequence | MVGYPKIRPR RLRQNKNIRD AVAETKLTHD NLILPIFVKE GISKPEEISS MPDVYRYPVG DPLIKFVEGN YSKGIRKVIL FGIPSFKDNI ASSAYQKDGV IQRSLKLLKE TFGDKILLFA DECTDEYTSH GHCGIVNYRG KQYYIDNDES LKVHAKIALS QAEAGADVIA PSSMMDGVVG AIREELDRNG FTDTLIMSYS VKYASVFYSP FREAASSAPA FGDRKSYQMD PRNANEAIKE ARLDLEEGAD ILMVKPAHTY LDVIRLVKET YPEYPLAAYH VSGEYSMIKA AAINGWLNEK VAVLEITHAI RRAGADMILT YYAPKLAEWI LEASPF
|
| |