Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1010 |
Symbol | |
ID | 5105609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 929560 |
End bp | 931158 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506909 |
Product | phosphoesterase |
Protein accession | YP_001191102 |
Protein GI | 146303786 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3511] Phospholipase C |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00457541 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGGCTC TAACTTGGAT CGGAATAGTT CTCACGCTTC TCTCGGCTCT CTCTTTTCTT TCAATCGTGA GCTCTGGAGC GACTCAAGCC ACAGCCACAC CAATAAAACA TGTCATATTC ATAGAACTTG AGAACCACGC TTTCGACAGC ATTTACGGCA CTTATCCCTT TGGATACCCC GTGATCGTGA ACAATATCAC CATGTCAGTC ATGAGGCCCG TGAATTACAT TTACAACTTA TCCCTCCTAA ACACCCTGTC TCAGTCCCAC GGAAACGTAA CCTGGATCTC AGTTCCCGCC GGAAAGGGCT ACCTTCACCC CTATTACGCC AATTCAACCG TCCTAGTAAA TCCCAAGGAA GGTTACACCA ACTATCATGA GGACTGGAAT TGGGGGCAAA TGAACGGCTT CGTGAACGGA TCTGGCCCTC AGTCACTGGC TTACGTCTCA TATGAACAGG TACCGCTCCT CTGGGATTAC GCCGAGGAAT ACGTACTCTT TGATAACTAC TTCTCTCCCA CCCTATCCGT GACTGTACCC AACAGGATTG CATACATTAC AGGTTTTCCC ACTCAGGTTG AGAGCGACGC CCCTCAATTT GGGTTAATAC CTCTTAACGA GTCTATCCTT TACCAGCTCA CGGAGAACAA TGTAAGTTGG GGCTGGTACG AATACGGTTA CTCCAAGGAC TTTCAGATAC TATCCCCTGA TCTTTACCTT GGATACAACA ACACGGCACC CCTGCCCGTT AGCCTCTTGA AGGGAGCGAA TCAGTGGAAC TCGCACTATC ACGACCTTTC AGACTTTCTG GCTGAGGCTA GAAACGGGTC TCTTCCATCA GTCTCATACG TCATGTTCAC GGGTCCCATG GGGTATGACG ATCACGTGCC CGGTTACGAT ATGCATCCTC CCTACAATAC CACACTCGCT ATGCTCATGC TCTCCACAGT GATCAACGCC GTGATGACGG GGCCAGACTG GAACTCCACT GTGATTTTCA TCACCTTCGA CGAAGGCGGA GGATACTACG ATCCAGTCCC TCCACCAATA GTTAATGGGT TCGGTCTCGC CAATACTCCA ACAATATCCA AGATATTACC GGGTTACTTC ACCCTAGGGC AGAGGATCCC GCTCCTTATG GTTTCGCCCT ACTCCAAGGA GGGATTCGTG GACAACTACA CCGCTTCGGG CTACTCAATC CTTGCCTTCA TTGACTACAA CTGGCATCTT CCCTACCTGA ACCCCATAGT GAAGGAGTTC GGACCAGAGT CAATCCTTTA CGGGCTTAAC TTCACTGCTC CAAGGCCTCC CCTGGTCCTG ACCCCTGAGA ACTGGAGTTA TCCGGTTCCC CTACAGTATC CAATTCACTA CGGCTACGTG GCAACCATTA ACAATAACTA CAGCATCTAC AACGCGATCT ACCACGATAA GCAGATGGGC AACTACACGC CCCCGCAGTA CTTCCTTGAG GGCAACGTGG TGCAAGGCGG GGTTCAGGAA GCCACGGGCT CCTCGGCTGG TTTCCCAACC CTCCTCCTGT GGATTCCAGT CCTCCTCATC ATCATAGCCG TGGGAGTCCT CCTGGAGAGG CGTAAGTGA
|
Protein sequence | MKALTWIGIV LTLLSALSFL SIVSSGATQA TATPIKHVIF IELENHAFDS IYGTYPFGYP VIVNNITMSV MRPVNYIYNL SLLNTLSQSH GNVTWISVPA GKGYLHPYYA NSTVLVNPKE GYTNYHEDWN WGQMNGFVNG SGPQSLAYVS YEQVPLLWDY AEEYVLFDNY FSPTLSVTVP NRIAYITGFP TQVESDAPQF GLIPLNESIL YQLTENNVSW GWYEYGYSKD FQILSPDLYL GYNNTAPLPV SLLKGANQWN SHYHDLSDFL AEARNGSLPS VSYVMFTGPM GYDDHVPGYD MHPPYNTTLA MLMLSTVINA VMTGPDWNST VIFITFDEGG GYYDPVPPPI VNGFGLANTP TISKILPGYF TLGQRIPLLM VSPYSKEGFV DNYTASGYSI LAFIDYNWHL PYLNPIVKEF GPESILYGLN FTAPRPPLVL TPENWSYPVP LQYPIHYGYV ATINNNYSIY NAIYHDKQMG NYTPPQYFLE GNVVQGGVQE ATGSSAGFPT LLLWIPVLLI IIAVGVLLER RK
|
| |