Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0761 |
Symbol | |
ID | 5103450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 694051 |
End bp | 695190 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506666 |
Product | metallophosphoesterase |
Protein accession | YP_001190860 |
Protein GI | 146303544 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.059163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0154389 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATTCTCC ATATCTCCGA CACTCACTTG GGTAGTAGAA GATATAACAG GGACTCAAGG GAGCAGGACG TTTATGACGT CTTCTCCCAG TTAATTGACC TCGCAATAAG GGAGCACGTG AGAGCTATTG TTCATTCTGG AGACCTCTTC GATGTGTACA AGCCCGGGAA TAAATCCCTC AAGTTCTTTG TGGATAAGGT GAAACTCCTA AGGGATAAGG GCATAGATTT CATTAACATA CCGGGAGACC ACGATACGCC TAAGGTGAGA GACGAAATAT ACACCCAAAG ATTGCTGGGC GAGTCCCTAG GCTTGATCGA GATGTTAATG GGGGATCAGG ATCCAAGGTT CGTGGAAATA GATGACGGTG GGATAAGGAT AAGAGTGTAT GGAATTAGGA GCATGAGCAC CGTCTTCAGG GACAACCTTC TGAATCTCCT TGGCTCGCTT AAGCCGGAGG GAGAGAGAAA TGTCCTCATG CTCCATCAGG GCTTCAGGGA AATGTTGCCC TATGACAATG CCTGGCAACT GGAGATAGGT TCCCTTCCCA AGGGTTTTCA ATACTACGCT TGTGGCCACC TCCATTCGAG AGACGTAAGG GTGCTACCTT GGGGTGGGAT ACTTGCGGTA GCCGGTTCCC CTGAGATAAT AAGGGAGGAG GAAATTGAGG GCTGGAGGAA AAACGGTAAG GGAGGATATC TGGTAGACCT GAGCAAAAGG GAAGCTGAAA TACATCCACT TAACGTTGAT GTGAGGCCCC AGGAGGTAGT TAGGATAGAC ACTGCGAGCG TGGACCAGGA CATAGATAAC ATAAGGAAAA AACTGAGTGG GGGTAGGAAG CCAATCCTTC ACGTTATTTT GGAGGGTGAC TCCTCCAAGA GGAGTTACGC CATGAAGAGG TTGAGCCAAC TCTCTGAGAT TGCAGAATTT TACAGGATTT ACAAGGACGA GACAACAGAT TCGCAGTTAA GGGAAGTGAA GGCCAGTAAC AATGGCACAA TCTCGGAGCT AATAGCGGAG TACCTGAGAA AACAGGGTTA CAATGACGAG GAAGTTAAGC TCATCCTTGA GGTCATAGGC AAGTATGATT CAGATGAGGC TGATGAAATC CTCAAGAAGT TCGCGGAGAT GGAAAGATGA
|
Protein sequence | MILHISDTHL GSRRYNRDSR EQDVYDVFSQ LIDLAIREHV RAIVHSGDLF DVYKPGNKSL KFFVDKVKLL RDKGIDFINI PGDHDTPKVR DEIYTQRLLG ESLGLIEMLM GDQDPRFVEI DDGGIRIRVY GIRSMSTVFR DNLLNLLGSL KPEGERNVLM LHQGFREMLP YDNAWQLEIG SLPKGFQYYA CGHLHSRDVR VLPWGGILAV AGSPEIIREE EIEGWRKNGK GGYLVDLSKR EAEIHPLNVD VRPQEVVRID TASVDQDIDN IRKKLSGGRK PILHVILEGD SSKRSYAMKR LSQLSEIAEF YRIYKDETTD SQLREVKASN NGTISELIAE YLRKQGYNDE EVKLILEVIG KYDSDEADEI LKKFAEMER
|
| |