Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1702 |
Symbol | |
ID | 5105348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1641429 |
End bp | 1642628 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507596 |
Product | hypothetical protein |
Protein accession | YP_001191781 |
Protein GI | 146304465 |
COG category | [S] Function unknown |
COG ID | [COG1602] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.456812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.715012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGTGCA AGGGAACTAA GTTCTTGTGT GGCCTCTCCT CGTGCCCCAT CACCGAGAGA TTTAGGGCAG TAGTTAATGC CACGTCAAGG ATATCCCTGG ATAAGGGTGT TGTAGACGGA TCCACGCCTC CAAGCGCAGT AGTTGGAGAG AAGGGTTATC CAAGGGTATC TCTCAGTTTC AACCTTGCCC CTGGTGTAGT GGGCGATCAG GCCAGGGTTT ACGAAGATCC CGTGAACTGG TGGGGTAAGG CGACCATATA TGATATTATA AATTACCGCT CTTCTCTCAT TTCTAATTTC TCCTCTATTC AGGTAACGGA TGTGTGGAAA TTGTATGAAA GGGAGCTTTC CCTGGCCGTG GTGTCAGAGA GGCCTGTTCA ATCCGAGAGC AAGATCTCTG GTAAGCTTGA GGCCAAGCTG AGATTTGACG GTTACGTGTT ACCTCGAGGA CCCTCGGTTA AGGCTGAGGA GATCAGGGTG GTGGAGAACC CTAAAGTCCC TAGGATGCTT GAGAAACTGA TACAAGACGA CGTGAAGGCT ACAGAAGGAG TAGTTTCACT TTACGAAAGC GGGCAGGATA TATACAGGAT TATTGATGCC CTATCACTGG GATTGTTGGG CACCAGAAAG GGGAGGAAGC TAGTCCCCAC TAGATGGGCA ATTACGGCTG TGGATTCAAT CGTGGGGAAG GAACTTTATG ATCGTGTCGT GTCCCTCCCT GCAATAAACG AGGTGTTAGT GTTCTATCAA GGATACCTCG GAAATCACTT TCACGTGATC CTTTACCCAT CCTCCTACTC AATCTCATGG GTAGAGATCT GGCATCAGAT GGCCCTCTGG TCCAACGAGC TCGTGATAAC TGACCTTCAA GAAGACTACT GGGGAAACTA TGACACCCTC GACGGGGGAT ATATGGCAGC CAGAACATCG GTGCTCGAGT ATCTTAACTC GATCTCAAGG TCTGCGGGAG TTGTCATTGT TAGGGAGATC ACAAAGGATT ACTTTGCCCC ATTGGGAAAC TGGCACATCA GGGAGACTGT GAAAAGGGCT TTTCAGAACA GGATAGCTAA AACATCTAGC TTGGGCGAAG CCTTAGACCT TGTTCAGTCC AGGCTCAAGG AAAAAAGGGT AAACCTTAGG GAGATTAGAA CCATTAGGAA GATCCTCTCG CAGAGAAAGA TAGATGAGTT CTTTCAATAA
|
Protein sequence | MRCKGTKFLC GLSSCPITER FRAVVNATSR ISLDKGVVDG STPPSAVVGE KGYPRVSLSF NLAPGVVGDQ ARVYEDPVNW WGKATIYDII NYRSSLISNF SSIQVTDVWK LYERELSLAV VSERPVQSES KISGKLEAKL RFDGYVLPRG PSVKAEEIRV VENPKVPRML EKLIQDDVKA TEGVVSLYES GQDIYRIIDA LSLGLLGTRK GRKLVPTRWA ITAVDSIVGK ELYDRVVSLP AINEVLVFYQ GYLGNHFHVI LYPSSYSISW VEIWHQMALW SNELVITDLQ EDYWGNYDTL DGGYMAARTS VLEYLNSISR SAGVVIVREI TKDYFAPLGN WHIRETVKRA FQNRIAKTSS LGEALDLVQS RLKEKRVNLR EIRTIRKILS QRKIDEFFQ
|
| |