Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0636 |
Symbol | |
ID | 5103796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 583136 |
End bp | 584227 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506540 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001190735 |
Protein GI | 146303419 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.288673 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTGA TATACCCCGA CTATAGCAAG AACCTATATT CCCTCGGATG CGGTATCGCC AAGTGGTTAG GTGTTGAGTT ACAATGTAAT ACCAGCTACT CCCTGACCGG GAAAAAGCTG GTCCTCCTCA TACTAGACGG TTTTGGATGG AACATCATGG AAAGCTCGCT GGGCGAAGTG AAGGAGGCTA CCAAGATACA TGGGGTTTTC CCCTCGACCA CCTCAGCCAC TCTAGCTTCA ATATTCACAG GTAAGACCCC GGCTGAACAC GGAATCCTTG GTTATAATAC CTACGTGAAG AGACTGGGGG GAATAGTAAA CGTCTTGAGG TACACCCACC CAACCCTAAA TGAGAGGGAC AGCCTCTCCG ATGGGCTTCC CTTTGAGAAA GCTTTCCCAG AGGCAAAGGG TTACCTCTCG CAAGTAAAGG AAGGGACAGC CTCTGTTCTA CCACAGGGAA TTGAAAATAC CCAGTTCACC ACCACGGTGC AGGGAACCAC GCAAGAGACC AAGACCTACC TGAACGTATG GGATGCCTAC GAGTCGCTTA AACAACTCAT GGATAAGGGG GCAAGGTTCA TTTATGCCTA TATCCCTGAC ATAGATTCCC TTGCCCACAA GTATGGTCCC TATGCAGACC CAGTTAAGCT CGCCACCAGA GAAATCTTCA TGAGATTTTA CTCCCTCCTT AAGGAAAGGA CCGACTACAC TTCCATCATA ACTGCGGATC ATGGACTTGT GGATACGACG GAGAGAATTG AGATCGATAA GGACCAAGAA CTCATGAACA TGCTGGAGAT ACCTCCCTAC GGGGACTCCA GGGCCCTCTT TCTGAGGTCT AGGTACGACC TCAAGGTCTT CCTAGAGAGT AGATATAACC TCAAGGTGTT TGACAGGGAT GAGACCCTTA AGCTCCTGGG AGGGGTAGAC AAGGTCCCAG AGAGTATGCC AGACTTCGTG GGCGTTCCCC TAGACTACTC GTCTTATTTC TTTAACTTCA GGGAGAAATC AAACTACACA AGACTTAAAG GCCATCACGG TGGCCTCCTA AGAGAAGAGT TGGAAGTTCC ATTGGTGATG ATCAATGGTT GA
|
Protein sequence | MSLIYPDYSK NLYSLGCGIA KWLGVELQCN TSYSLTGKKL VLLILDGFGW NIMESSLGEV KEATKIHGVF PSTTSATLAS IFTGKTPAEH GILGYNTYVK RLGGIVNVLR YTHPTLNERD SLSDGLPFEK AFPEAKGYLS QVKEGTASVL PQGIENTQFT TTVQGTTQET KTYLNVWDAY ESLKQLMDKG ARFIYAYIPD IDSLAHKYGP YADPVKLATR EIFMRFYSLL KERTDYTSII TADHGLVDTT ERIEIDKDQE LMNMLEIPPY GDSRALFLRS RYDLKVFLES RYNLKVFDRD ETLKLLGGVD KVPESMPDFV GVPLDYSSYF FNFREKSNYT RLKGHHGGLL REELEVPLVM ING
|
| |