Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1523 |
Symbol | |
ID | 5104051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1484844 |
End bp | 1485986 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507410 |
Product | hypothetical protein |
Protein accession | YP_001191603 |
Protein GI | 146304287 |
COG category | [S] Function unknown |
COG ID | [COG1415] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.427834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.251629 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATCA CGGGCATCAG CGACCTACCG CTTCACTATG GCAGAGTTCC GCAATGGCTT ATCCCTATCA TGAAGAGGTT GTCGAGAGCC ATAGTGGACG TGATGCTTCT AGAGTGGGGT CCAGATAAGG TGATAGAGAG GTTATCTAAT CCTCTCTGGT TTCAGGGTTT TAACAACGTT ATAGGGATGG ACTGGGACTC CTCAGGTTCA ACAACTGTAA CGCTAGGTAT CTTGAAGGAG GTGATAAACC CAGTACAAGA CGGATTAGCC GTGACTGGCG GAAAGGGAAA GAACTCGCTG AATGTACCCA AGGAACTGGA GGCTCTTCCA TCCAATTTCA ACGTTGACGC TAAGCGACTA TCCAGGATTA GTAAGCTTGT GGCTAAGACT GATACCACCC TTCTTCAAGA CGGCCACGAA CTATATCATC ACTCCCTCCT GGTTTCAGAG TCCGGTAAGT GGGCAATAAT TCAGCAGGGG ATGAATCCCA CCACGAGGTT TGCTAGGAGA TATCACTGGA GCTCCATTAA AGACCCGGTA AATGAGAAAC GTGCGGGATT AGCTGGACGA AAGGAGAAGG CGGTCCTTAA CGTCCACGAG TCTGTGAACG CCAAGAGGGT GATAATGGAT CTGCTAAGAG AGGATCCTTC CAAGATAGAG AAACAATACC TCAGGTCAAT GGCCTTGATC AGGGGCACCT CTCTGGACTC TTGGATCAGT CTTGGAACCG TGGGTGCCAT TTCGTCTGAG GCAAAGATGG TCTACATGAA GCCCGTGGAT GTGAGGAGAG TGGTGAAGAC ACTCTCGGAA GTAAGGGAGA AGTCACCTAC CAGTTTAGAG GAAGCTCTCC TTCTAGGTAT AGGTCCCTCA ACCATGAGGG CGTTAAGCTT GATATCTGAC CTCATTTATA ACGAACCTCC CTCATACCAG GATCCCGTAA ACGTGCCCTA CGATCCCTTC AAGTACGCCT TCGCGATAGG TGGGAAGGAT GGAATCCCCT TCCCTGTGCA CAAGGAGATC GCCTTTGAGG TCATTCATAC CCTTGAAGAG TTTGCTATGA AAGCCAAGCT AGAGAAAAAA GATAAAGCCG TTGCATTAAA TAAGCTGAGG GAAATGAAAC TTGGAGTTAA GGAAGGGACT TGA
|
Protein sequence | MEITGISDLP LHYGRVPQWL IPIMKRLSRA IVDVMLLEWG PDKVIERLSN PLWFQGFNNV IGMDWDSSGS TTVTLGILKE VINPVQDGLA VTGGKGKNSL NVPKELEALP SNFNVDAKRL SRISKLVAKT DTTLLQDGHE LYHHSLLVSE SGKWAIIQQG MNPTTRFARR YHWSSIKDPV NEKRAGLAGR KEKAVLNVHE SVNAKRVIMD LLREDPSKIE KQYLRSMALI RGTSLDSWIS LGTVGAISSE AKMVYMKPVD VRRVVKTLSE VREKSPTSLE EALLLGIGPS TMRALSLISD LIYNEPPSYQ DPVNVPYDPF KYAFAIGGKD GIPFPVHKEI AFEVIHTLEE FAMKAKLEKK DKAVALNKLR EMKLGVKEGT
|
| |