Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0232 |
Symbol | |
ID | 5104098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 192050 |
End bp | 193858 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506137 |
Product | hypothetical protein |
Protein accession | YP_001190333 |
Protein GI | 146303017 |
COG category | [S] Function unknown |
COG ID | [COG2433] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0116565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGTAA TGGGTATCGA CATAGAGAGG GGGTCGCCCA ATTCCACGGA ACAGCCTAGA TACTCAGTGG TAATTCTAGA CGAAAACGGA GAAACAGTGG TTAAGGTCGA AGATGTAACT AGGAGTAGGT TAGTAAGACT GGCCTGGGAG TACGACGTCT CACTTCTTGG AACTGACAAC ATTTACGAGC TAGGTAGCAA CGACAAGGAA GTAATATCGC TTCTATCCCT GTTACCCGAG AAACTGGAGG TAGTTCAGGT TACAGTAAAG AACGGCGTCT TCCTCGACCT AAAGGATGTA GCTAAGGAGT ACGGTATTGA GATTCAGGGG AAGCCGACTC CGTCCAGGAC TGCCTTTATC GTGGCGACCC TAGCTCTCAA GGGGGCAGGG ACCAAGATAA AGTTCGTTGA AAACAGGACC AAAATCATCA TTTCCAAGGG GAGAAGATCA GGACCAGGGG GAATGAGCTC CAATAGATAC AAGAGGCATC TGAGGGGATT GGTGCTCAGG GTTTTTAGAA GAGTGAAGGA AGAACTGGAC AGACACAACT TTGACTATGA CGTGGTTGTC AGGAGAACTA AGGCTGGAAT GGAAGGGGCT ATGTTCATTG TTTATGCCCC AAGGGAGAGC CTATACGGTC TAGTCAAGAA AATGAGTGGA CACGACGTTA ATCTCGAGAT CAGGGCCTAC TATAGGGATA GAATTGAGTT TGTGGACACT AAGAGGGTTT CCCAGAGACC TGTTATAGTA GGGTTAGATC CAGGTCTGGA AGTTGGAATC TCGATCCTCG ACATGTACGG AAATCCTGTA CTCCTCACTA CCAAGAGGGG AATTGACAGG GAGTCTGTGA TAGAGCTCGT TCTAGAGAAG GGGACCCCAG CCCTCATTGC CACAGACGTT AATCCAGTGC CAGACACGGT AAAGAAGATG AGCGCAATTC TGAAGGCTAG ACTCTACGTT CCTGAGAGGT CGCTTTCGGT GGACGAAAAA CAAGCTCTCC TTGACGAATA TTCCACGAAG TTTGGAATTC ACGTAAGCGA CCCGCACATA AGGGATTCCC TAGCTGCTGC CATTGTTGCG TATAGGGACG TAGAGAGGAA ACTCAGGCAG GCTGAGGGCA TGATAGGTAG GTTTGGAATA GACATAGACA GAAACAACGT GTTCAGGTGC GTTGTAAACG GGGGAACCAT TGCCGAATGC ATTGAGAACG AGATAGAGAA GAAGATATCT GTTCCGCAAA ACGCCGGTAT AGTAAAACAG GAAGTAAAGA CTGAACACAA CGAGAAGTTG GCTGAGGAGA ACACATTGCT CAAGCAGGAG TTAATCAGAC TCAACAGGAC AGTGTCTAGG TTAATTCATG AAAAGGAGAT GCTTGAGAGG AGGGTCGAGG AAATAAAGAG GCTTTACAAT GCTGAACTGG ACAGGGATAG GCGAGTGGAG GAACTGAAAA GGATACTGGA GCAGAAGAAC AAGGAGATAA TCAAATTGAA GGAACTATCT CAGGCAGAAT CGGAATCACT GGCTAAGCTT TCCTCAATTA TCGAGAAACT GGTGAAGAAT GAGGTCACAG TGGTGAGGGG ATACCTGAAG GGGTTGGAGG TTAGGGATGG CCAACTGTAT TTTGGAGAAT GGAGAATAAG CAACGATCTG GCAGAGTACG TGGGGAGAGA TTTCGCCCTA ATTGACGAGC GCCTCATTAA GGATCTAAAC CTTCTTAAGA AGGAGAAGGA AATAAGCCGT GAAATGAGTG AAGACCTGCT AAAAAGATTG GTCGAAGAAT ACAGATCTTC AAGGTCTAGA ATAGCATAA
|
Protein sequence | MRVMGIDIER GSPNSTEQPR YSVVILDENG ETVVKVEDVT RSRLVRLAWE YDVSLLGTDN IYELGSNDKE VISLLSLLPE KLEVVQVTVK NGVFLDLKDV AKEYGIEIQG KPTPSRTAFI VATLALKGAG TKIKFVENRT KIIISKGRRS GPGGMSSNRY KRHLRGLVLR VFRRVKEELD RHNFDYDVVV RRTKAGMEGA MFIVYAPRES LYGLVKKMSG HDVNLEIRAY YRDRIEFVDT KRVSQRPVIV GLDPGLEVGI SILDMYGNPV LLTTKRGIDR ESVIELVLEK GTPALIATDV NPVPDTVKKM SAILKARLYV PERSLSVDEK QALLDEYSTK FGIHVSDPHI RDSLAAAIVA YRDVERKLRQ AEGMIGRFGI DIDRNNVFRC VVNGGTIAEC IENEIEKKIS VPQNAGIVKQ EVKTEHNEKL AEENTLLKQE LIRLNRTVSR LIHEKEMLER RVEEIKRLYN AELDRDRRVE ELKRILEQKN KEIIKLKELS QAESESLAKL SSIIEKLVKN EVTVVRGYLK GLEVRDGQLY FGEWRISNDL AEYVGRDFAL IDERLIKDLN LLKKEKEISR EMSEDLLKRL VEEYRSSRSR IA
|
| |