Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0728 |
Symbol | |
ID | 5103766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 665393 |
End bp | 666877 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506632 |
Product | peptidase M61 domain-containing protein |
Protein accession | YP_001190827 |
Protein GI | 146303511 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.219972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTTCA CTGTAAAGCC TAGGCCCAGA TATCTAGAAA TAGAGGCTCA TGGAAGAGAA GGAATTGTAA TATTTCCCAC GTACCTACCT GGTTCCTACG TAGTCAGGGA GCTAGAAAGA AACATTGTCG AGATCGATGG GGTTAGGATT TCAAAGAATC GTTTTTACGT CAAGGAAAAC TTTAGATATC TAGTTTACGC ATCCAGTAAG GACCAGAGAG AGGCCATCTC CTCCACCGAT TATCTTTTTA TAAATCCCCC TGCCGTGTTT CCCTTTCAGG ATTGGAACGA GAGGTACTGT GTCAAGTTGG ACTTGAGATG GCCCGTTAGC ACGACGCTGA GGAGAGAAGG GGAGTATCTA TGTGCAGATG ACTATGAAAC TTTTGTGGAC TCGCCAATCC AAGCAAGTCC AAACCTGAAA ACCCTTGTTA TAGACGACCA CCATGAGATA ACCACAGTAG ATGATCTTGA TCTGACGGGA GTAGCCATGG CAATCAAGGA AATAGACAAG GAGATGGGTA CACCAGATAG GTACACCTTC TTCTTCAGGA GGTCAGACAG AAACTACGGA GGCATTGAGC ACTATAACTC CTCCGCAATC GTGGTAAACT GGGAAAGATC TGACCTAGTC ATGCTAATGG CTCACGAGTA TTTTCATAGC TGGAACGTGA AAAGGTACAG GCCAAAGGAT CTCGAACTAG ACTTGGAAAA GGAGACCCAC TCGGATCTAC TTTGGTTTGC TGAGGGGGTG ACGGATTACG TGGCTTGGCT AGCTTCCACG AGAAGCGGTG CAGTGAAGAG TGAGGACACT GGAAAATACA TGGCTAACGC TATCTCCAAG TTCACCTTTC CTGGGGCAAA GAGAATGTCG TTGGCTGAGT CCTCTAGAAC CACATGGATA AAGTATTACA GGCAAGACGA GAACTTCCTG AATTCCTCAG TTTCCTATTA TGACGGAGGA CTATTACTGG GGCTGATACT TGACGCGAGG CTTAGGAGAA GCGGTGAGAA CATATTTAGC ATATTCAAGA ACATACCTTT CAGGTATACG TTTAGCGATA TTGACAATTA CCTGAAATCA AGGGGCATAG ATGACCTAGA GGAGATGGCG TACTCCCCGT CCTCTATCCT CCTAGAGAAA CTTAAGGAGG TAGCCGAGCT TCAATTTCTG GATGGGGGAA ATCCTTACCT CGGAATCATG ATGGACGGTA ACAAAGTGAC TTATGTGGAG GACGGTTCCC CTGCTGATAT GGCGGGCTTG ATGCCCCAAG ATATCATTCT AGCAACGGAC AACGTGGTAA GACCTGTTGA AGTGAAGGCG CAGGTGGAAC TCCTAGTAAA TAGGGAGGGA AGGGTGAAGA GGGTGTTGGT CACGGCAGGA AGGAACCCAG GACACAGGGT GAAGTTCACG ATCAAGGGGG ACATTGCGAA GCAGTTGTTG GGGATGGATT CCTTGGATGG AACTTCATCC ATAAGCTTGA TCTAG
|
Protein sequence | MLFTVKPRPR YLEIEAHGRE GIVIFPTYLP GSYVVRELER NIVEIDGVRI SKNRFYVKEN FRYLVYASSK DQREAISSTD YLFINPPAVF PFQDWNERYC VKLDLRWPVS TTLRREGEYL CADDYETFVD SPIQASPNLK TLVIDDHHEI TTVDDLDLTG VAMAIKEIDK EMGTPDRYTF FFRRSDRNYG GIEHYNSSAI VVNWERSDLV MLMAHEYFHS WNVKRYRPKD LELDLEKETH SDLLWFAEGV TDYVAWLAST RSGAVKSEDT GKYMANAISK FTFPGAKRMS LAESSRTTWI KYYRQDENFL NSSVSYYDGG LLLGLILDAR LRRSGENIFS IFKNIPFRYT FSDIDNYLKS RGIDDLEEMA YSPSSILLEK LKEVAELQFL DGGNPYLGIM MDGNKVTYVE DGSPADMAGL MPQDIILATD NVVRPVEVKA QVELLVNREG RVKRVLVTAG RNPGHRVKFT IKGDIAKQLL GMDSLDGTSS ISLI
|
| |