Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0469 |
Symbol | |
ID | 5105465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 423966 |
End bp | 425825 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506375 |
Product | hypothetical protein |
Protein accession | YP_001190570 |
Protein GI | 146303254 |
COG category | [C] Energy production and conversion |
COG ID | [COG0348] Polyferredoxin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAAGT TAGAGTACAA GGTTACTGGA AAGGTTAGAA ATTATGAAAG GAGATTCAAT TTTTATGTGG CACTCTCAAC CGTGGGAACA GGAGCATTTA CTGGAATTGC GGTATTACTT AAGCAGGTTT TGATGATAGA GACCGGTATA TTACTTTTCA CTACGGCTCT CACAATCCTG GCCGTTAATC TTGTGCTGGA TCTAACTGTG AAGTCCCATT CCAATACCTG GGTCTTTGCT TCTCCTCCTA GAGAGATAGT GAAGAAAGCT GACAGAGTGG GAAAGGAGAT CTGTGAACAT CGCCCTTCCC TATTAGAGGG GAATAACCCA GTGTCCAGGT TGGTTTCTAA GCTGTTTAAG AAGAGCTGGG CACACTTCGC AATCATTCTT CCAAGTTTCA TAATATTCTA CGTAGTCATG GTTGTGGGTT TAGTGGGGTA TCAGAAGCTT GGTCCTGCAG GGATATCCTT GGTGAATTTT GCCTCAGACA TTAGCTGGTT GTTCTGGTTT CCCTTACTCT GGTTGTTGAC CTGGCTAGCT AACGGCAGAG CGTGGTGCCA GACCTGCCCC TTTAGCGGTC AGGCAGAGTG GGTCCACAGA TTGCATCCCT GGAAAAAGAT GAGCAAGAAG CTGGGGCTAA ACCTCAGGTG GCCCATAAAG TACAGCACCA TCTTCTATTC TGCTGTGGGC TTCTCGGTCC TAACCTGGAT GGAGGAGTTT TACGGAATTG GAGGTCCTGG AATTCCGGAA CTGACCTCAG TGGTGTTGAT ATACATTGGT GCCCTGGAGC TCTTCATTAG CCTGCTCTTC CAGGACAGGA CCTTCTGCAG GACAATTTGT CCCCTAAGTG CCCCTTTGGC CATAACCACG ACAATCTCTC CACTTGGAAC CTTCAGGGCG AAGAATCCTG AGGTATGTAA GTCCTGTACC ACTAAGGATT GCATGAAGGG GAACGATAAG TTCCACGGTT GTCCTTGGTT CGCCTCCCCA GGAAGTAAGG AGAACTCACC CATGTGCGGG CTAGCCTCGG ACTGCTACAA GGCATGCCCC CACGACAACA TTGACTGGCA GGTTAAGAGG TTCCCATGGT TGAGCGATCT CGCCGGAGGC AAGAAGAGGT TTGATATAGC CCTCTCAGTG ACTCTCCTAA CTGGGGTTGT CCTCTTTCAG TTCCTTAATG CACTGCCCTT CTACTCCATG GTGGATACCT GGCTGAGCAA AGTGACAGGA TGGGTTAATT TCGCTCAGCT ACTAGTTCCT GGACTGTCCA AGTTTGGCTA TTCGACTCAT GGCTATCCAA ACCCCCTGGA TTACTTTGCC ATCAACATGA TACCTATCCT TGTGGTCCTG GCTGCAGCCA AGTTTGAGGA AAGGAGGGGA GTACCCCTGA AGTGGGGATT CACGTCTATA TCTTATGCGC TGATACCCAT CTTCGCTGCA TCGTTACTGG TGAGAAACCT ACCCAAGTTT CTAGGCGGAT CTCCGTTGAT CCTCAACGAG ATCCTTGACC CCACTGGGGC CGGTATGCAC AACAGTGAGA TCTACTCAAC CTTCTGGGGA AGCCTGCTTC ACTCCCTGGG TCACGATCCG CTCAACGCCA CGGCTGCCTG GTGGGTTCTC CTTGTGATGG AAGCTGTAAT GGCCTTCGGC ATCTACCTAG GATTGAGGGC TTCAAACATG CTGGCTGAGA CTGACGGAGT GGGCAAGTGG ACATACTATG CCGTAGTTCT GGGGTTTGGG CTAACCTTCA TGCTAGTGAC CTACTGGATG TCTTCCCCTG CCTCTCCCAC AGCGCCCTTC TACAACCAGT ATCTCGGAAA CCTACTCTAC AACCCACTTC AGGCTACTCC GCCGTTCTGA
|
Protein sequence | MEKLEYKVTG KVRNYERRFN FYVALSTVGT GAFTGIAVLL KQVLMIETGI LLFTTALTIL AVNLVLDLTV KSHSNTWVFA SPPREIVKKA DRVGKEICEH RPSLLEGNNP VSRLVSKLFK KSWAHFAIIL PSFIIFYVVM VVGLVGYQKL GPAGISLVNF ASDISWLFWF PLLWLLTWLA NGRAWCQTCP FSGQAEWVHR LHPWKKMSKK LGLNLRWPIK YSTIFYSAVG FSVLTWMEEF YGIGGPGIPE LTSVVLIYIG ALELFISLLF QDRTFCRTIC PLSAPLAITT TISPLGTFRA KNPEVCKSCT TKDCMKGNDK FHGCPWFASP GSKENSPMCG LASDCYKACP HDNIDWQVKR FPWLSDLAGG KKRFDIALSV TLLTGVVLFQ FLNALPFYSM VDTWLSKVTG WVNFAQLLVP GLSKFGYSTH GYPNPLDYFA INMIPILVVL AAAKFEERRG VPLKWGFTSI SYALIPIFAA SLLVRNLPKF LGGSPLILNE ILDPTGAGMH NSEIYSTFWG SLLHSLGHDP LNATAAWWVL LVMEAVMAFG IYLGLRASNM LAETDGVGKW TYYAVVLGFG LTFMLVTYWM SSPASPTAPF YNQYLGNLLY NPLQATPPF
|
| |