Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0879 |
Symbol | |
ID | 5103525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 812318 |
End bp | 813592 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640506782 |
Product | hypothetical protein |
Protein accession | YP_001190975 |
Protein GI | 146303659 |
COG category | [S] Function unknown |
COG ID | [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.379593 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCAA TAAATCGCAG ACTAATGGAT AACATTACCG TCCTAGAGAA TAAAATTTTT GATTTAAGCA GGTTGGTGAA CTTACAAGCT TACATGGAAG GCTTCACGTT CTATACTTCT TCAAGGTACA GGGGTATACC CATCGACGTT ATTCCTAGAG TTATTAGTTC AGATTTTTAC TCTAGGATAT CCGATTATCT AGTTAAAAGG AACCTAGTTC TTAACCACTT GGTCAAGGAA ATTTACCAGG AGAAATCCGT GCACATACCT GATTGGATAG TCAAGACTAC ACCCTTTTTC AAACCGGAAA TGATGGACTT CTCCCCACCC AAGGGAATAT ACATCTATGT GAATGGTGCT GATATTGTTA GAGTAAATGG TGAACCCTAC ATTCTTGAGG ACAACGTAAG GGTGCCCTCA GGAATAGCCT ATTCTTATAG GGCATTCGAA TACGTTCATC GATTTTTACC TGAACTATCT CAGGGTTACA GCGTGGAGGA ACCATCGGGT TTAGAATATC TTTACGATAC ACTTCGCTAC GCGAGTGGAA GCAAGGATCC CGTAATCGTG CTTTTAACGG ACGGTCCCCT AAACTCCGCT TACTTTGAAC ACAGGTTCAT TTCTGACAAA CTCGGGTTTG TGTTGGCAGA GCCCAAGGAT ATACAGGTGA ACCAGGGAGA AGTTGTGGTA AAGACGTTAG ATGAGGGGGA AGTTCACGTT GACATAATAT ACAGGAGAAT TGAGGACCTT GAGTTGCTCA CTCCAGATTT GATGAGGGCA TATCTACGCG GGTGGGTTAC CATTGCCAAT GCCCCAGGCG TCGGAGTAGC TGATGACAAG GCAACCTTCG TTTGGATACC ATTCCTAGCT GAGAGATATG GAATTTCCCT TAAAGAGGTT ACACAGCCCT TTACCATATG CCTATATGAA AGGGAGAACC TTCAAAGGGT GATTAATAAC CCCTCGAGTT ACGTGATAAA GAAGAGGGAA GGCTATGGAG GCATTGGTCT CTCCATAATG AAGGATGAGA ACGCCAGTGT CCTAAAGGAG TTGGTAAAGG AGTATGAGAA CTTCATAGCG CAAGAGGTTC TCGATTTCGA CACCGTGGTC TCTGCGATAA ATGACTCGTT TTACGAGACT TTTGCAGATT TCCGCTTCTT CACATATTAC GATAGGGTGG CCACAGCAGT TTTAAGTCGA GTCGGTGTGG TTGGAAGTAG GGTAACAAAT AACTCTTCTG GAGGGATGGT GAAACCGGTG TGGATTACGA GGTAA
|
Protein sequence | MDPINRRLMD NITVLENKIF DLSRLVNLQA YMEGFTFYTS SRYRGIPIDV IPRVISSDFY SRISDYLVKR NLVLNHLVKE IYQEKSVHIP DWIVKTTPFF KPEMMDFSPP KGIYIYVNGA DIVRVNGEPY ILEDNVRVPS GIAYSYRAFE YVHRFLPELS QGYSVEEPSG LEYLYDTLRY ASGSKDPVIV LLTDGPLNSA YFEHRFISDK LGFVLAEPKD IQVNQGEVVV KTLDEGEVHV DIIYRRIEDL ELLTPDLMRA YLRGWVTIAN APGVGVADDK ATFVWIPFLA ERYGISLKEV TQPFTICLYE RENLQRVINN PSSYVIKKRE GYGGIGLSIM KDENASVLKE LVKEYENFIA QEVLDFDTVV SAINDSFYET FADFRFFTYY DRVATAVLSR VGVVGSRVTN NSSGGMVKPV WITR
|
| |