Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0060 |
Symbol | |
ID | 5104252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 57034 |
End bp | 58632 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640505958 |
Product | hypothetical protein |
Protein accession | YP_001190161 |
Protein GI | 146302845 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.214565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCC TTCAAAGAGA ACAGTTAGTT CAGGCCGAGA TAAACGGGGA TTCTCAGGCC GTTTTGACCC AAATGAGGGT GATCAATGAG CAACTCAATA ACTTTGTTCA GAACGATAGG GTACTGGAAT CGATCTCCTT TAACGCAATA GCCATGGTGC GGGTTGGGTC ACCTTTCACG TATATCACCA ACAACGATGA GGCCAACGAG GAGGCCAGAT GTAACGGCAG AACCGAGGAT AACATGAGGG ATAGATTCAA CGCGGGGGAC AGGGAGTACA TGAAGCTCGC CTTCGTTTCC ATAGCCAGCG ACGATGACAT ATCTGCGCAA ATCGTGGGCA AGCCCGTCAA GGAAGCTCCC AAGCTCGTGT TCAGACAGGG CGTTATGGTA AAGCTCACGA ATCCCTCAGG AAAGGAAATT ATGGTATACG TGGGTGGAAT CTCGCCAGAG TGGAGAGGGG ATAGAGTCTC AACTTATCAG TACGGTGCAG AGAGGAGATT TAAGTTCAGT TCCGGGGTTA GGAATAGGAT CTCGGGCGTG TTGGGCAGGT TCAACATTGG CGTGCTCTTT GGCGCAGAGA GGGCATCTCA GGTATTTGTG AATCCTCCCT CACCCAAGTC GTCAAGCTAC TCCGCGTTCT TGCAGTCATA TAGGAGAGAT ATACTGGAGG ATAGCATCAA AAAAGGCCCT GGCCTATTTT ACAAAACTCC AACTTCGCAC GGAGGGTGTA ATGCATGTAC CTCTATAGCG TTATCTACTA GCTCTAACTT TCCAGATCTT CTCCCGGCTT TCTCCACTCC CTTTGTCCCA TTCATTGACC AGAGCTATTA CTTGATCCCT GGCTATCCTG AGAACCCCAT AGACGAGTAT TACCTTGGGT CTCCCGTCTA TGCTGGTAAC TCCAACTGCG ACACACCCAC CATCGTAGGT TTCATCTCCT CCCTTGTACC CTTTTCCCAG GCCCTTCAGG CTTACGCCCA ACTCCAGAAC AGGTCAGTCC AGTCTAGTAA TCAGCTAGAA AGTCAGCTGA ACCAGTTCCT CTCATCCTCC TCGGTCGCAT TCGTCTTCCC CCCACCAAGG AGCTGGAGCT ACGACGACGT AATACAGTGG TCCTACTCCT TGGGCTACTC GCAGGACTAC ATATCCACGG CCGTCAATTT CGCGGTGGTG TTGGGAATTA TCCTTTCCCA GTACCCTGAG GACGTCGTCG ACGACCTCGA GAAAGAGCTG TATAGGCACG AGTTCAGTTG TTCCAACCAG GACCAATGTT ACTCTGAGAT GAGCAAGATA ACGCAGGATA TCTACAACGT GCTCTCCCAG CAGGGTTCGG ACAAGGAACT TGAGGCATTA CGTGTATGTC ACGCGGAGGC CACCACGCTC GACGAGACCT GTTACTGGAA CTGTGTGGAT AGGGTCTACG AGTCCCTCAA GCGTCTCCCG CTCTTTGAAA GGCTACACAT ATCACCTTCG CCCGAGGAGT TAAGGTGGGA TGCGCAAACC CAATGCAAGG AGGCCTGCAT CAAGTTTAAC GAGAAGAGGT ACCTCGAATG TCTGAAACAG ATATATCCCT CCAGGGGCAC TCACGTGACG CCCCAGTGA
|
Protein sequence | METLQREQLV QAEINGDSQA VLTQMRVINE QLNNFVQNDR VLESISFNAI AMVRVGSPFT YITNNDEANE EARCNGRTED NMRDRFNAGD REYMKLAFVS IASDDDISAQ IVGKPVKEAP KLVFRQGVMV KLTNPSGKEI MVYVGGISPE WRGDRVSTYQ YGAERRFKFS SGVRNRISGV LGRFNIGVLF GAERASQVFV NPPSPKSSSY SAFLQSYRRD ILEDSIKKGP GLFYKTPTSH GGCNACTSIA LSTSSNFPDL LPAFSTPFVP FIDQSYYLIP GYPENPIDEY YLGSPVYAGN SNCDTPTIVG FISSLVPFSQ ALQAYAQLQN RSVQSSNQLE SQLNQFLSSS SVAFVFPPPR SWSYDDVIQW SYSLGYSQDY ISTAVNFAVV LGIILSQYPE DVVDDLEKEL YRHEFSCSNQ DQCYSEMSKI TQDIYNVLSQ QGSDKELEAL RVCHAEATTL DETCYWNCVD RVYESLKRLP LFERLHISPS PEELRWDAQT QCKEACIKFN EKRYLECLKQ IYPSRGTHVT PQ
|
| |