Gene Msed_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0215 
Symbol 
ID5104081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp176688 
End bp177698 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content48% 
IMG OID640506120 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001190316 
Protein GI146303000 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000475247 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0306727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGAT ATCCCAAGAT AAGACCAAGG AGATTAAGAC AGAACAAGAA TATAAGGGAC 
GCAGTAGCGG AGACGAAACT GACTCATGAT AACCTAATCT TACCCATCTT TGTTAAGGAG
GGTATATCTA AGCCCGAGGA AATTTCCTCA ATGCCTGACG TTTATAGGTA CCCTGTGGGT
GATCCCCTAA TTAAGTTCGT AGAGGGGAAT TATTCCAAGG GAATAAGGAA GGTTATCCTG
TTCGGGATTC CATCCTTCAA GGATAACATA GCGAGCTCCG CATATCAGAA GGATGGAGTA
ATTCAGAGGT CGTTAAAGCT ATTGAAGGAA ACATTTGGAG ACAAGATACT TCTCTTCGCA
GACGAATGCA CTGACGAGTA CACTAGTCAT GGACACTGTG GGATAGTGAA TTACAGGGGG
AAACAATATT ACATTGACAA CGATGAGAGC TTGAAGGTCC ACGCGAAGAT AGCCCTGTCT
CAAGCCGAGG CAGGTGCCGA CGTGATTGCA CCCTCTAGCA TGATGGATGG AGTTGTTGGC
GCCATTAGGG AAGAGCTTGA CAGAAATGGC TTTACTGATA CCCTTATCAT GTCTTATAGC
GTGAAGTACG CCTCAGTCTT CTATTCCCCG TTTAGAGAGG CAGCCAGCTC AGCCCCTGCA
TTTGGGGACA GGAAAAGCTA CCAGATGGAC CCACGAAACG CCAACGAGGC GATAAAGGAG
GCCAGATTGG ACTTAGAGGA GGGGGCTGAT ATACTTATGG TGAAACCGGC CCACACTTAC
CTAGACGTGA TAAGGCTGGT AAAGGAGACC TATCCCGAAT ATCCCCTAGC AGCATATCAT
GTTAGCGGAG AGTATTCCAT GATCAAGGCC GCGGCCATAA ACGGTTGGTT GAACGAGAAG
GTGGCCGTCC TCGAGATCAC TCACGCCATT AGGCGTGCGG GGGCTGATAT GATCCTGACC
TATTACGCTC CAAAACTGGC AGAGTGGATT TTGGAGGCGA GTCCGTTTTG A
 
Protein sequence
MVGYPKIRPR RLRQNKNIRD AVAETKLTHD NLILPIFVKE GISKPEEISS MPDVYRYPVG 
DPLIKFVEGN YSKGIRKVIL FGIPSFKDNI ASSAYQKDGV IQRSLKLLKE TFGDKILLFA
DECTDEYTSH GHCGIVNYRG KQYYIDNDES LKVHAKIALS QAEAGADVIA PSSMMDGVVG
AIREELDRNG FTDTLIMSYS VKYASVFYSP FREAASSAPA FGDRKSYQMD PRNANEAIKE
ARLDLEEGAD ILMVKPAHTY LDVIRLVKET YPEYPLAAYH VSGEYSMIKA AAINGWLNEK
VAVLEITHAI RRAGADMILT YYAPKLAEWI LEASPF