Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1134 |
Symbol | |
ID | 5104155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1066056 |
End bp | 1070423 |
Gene Length | 4368 bp |
Protein Length | 1455 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507026 |
Product | hypothetical protein |
Protein accession | YP_001191219 |
Protein GI | 146303903 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000377058 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTAT CCCTTGAGAA TTTCATTGGT TCTGCTGTGA GTAATATTCT AGACGGTGTT CATGATCTCA AGAGTTTCAT ACTTGTATTG ACACCTCCAG GTTCCTCCCG GGGAGAGATT GTTAGGAGGT TGGCTGGTAA GCTCGAGGAT GTCGAGTTCT TAGTTTATGA GGAGCTTTAT AAGAGGCTTA ACGACCAAGA CTTAAGGTCA AGGTTTAAAC CACTTAGAGG CTTATCAGTG GAGGGTAAGG ACATTCTTGA GTATATCAAG GGTGATTACA GTAACATAAA GGAGGCATTA AGCACTAGAC GCGTTGCCAT AGTGCCCAGG AGTACCATGG ATGCCGTGGT GCTGGTTCGC AGGCTTAGGG AAGAGCTTGG TAGGGATTGG GGCTCTGCCA AGAAGCATGT AGTCATTAAC CACCTCTCCA GCATCTTTAA GAGGGAGCTT GAGGAGTCCC GCTCACAATT CACTCTCATT GAGACTCGCC ACGAGTTGCT CAGGAGGGAC GACGTGGTTA AGGAACTGGT CAGGGAGAGC GAGGGGATAA GCCTAGCCCT TTGGAATAGG GAGGGCAAGC TGGGGGAGAA GCTGGAGAGC GGTGTCAAGG CGATCAGGGC TCTGAGCCCG GGTAGCGTTG GATTTGACGA GTTAGCCAGG GATTTCGCCG GCGAGTTAAA GGTGATAATG CCGACCGTAG TACTAACCTC AGTAGCTTAC GTGGCCGGCG CGATGCTCGT GGCGCCGGTA ATACAGGCCC TGGTCTTGGC CGACGGGTTG ACCGCAATCT CCTTGGCCAT TGGACGATTC CTCGAGGAGT TCACCAAGAG GGTTGGGATT TATGCAGTGA AGGAACCGAT CAAGAAGTTC TCCGAGAGGT TCCTCGGCTG GCTCACCAAG AAGGGTGAGG CTAGGAACAA GTTCCTTAAG TCCATGGGCC AGCTCCTGAA GGCCGTAATA GTAGCACAAG AGTTCGTGGT CGATGATAGG TTTGAGGGGA TTGTGGATGA GGTGGCCAGC GAGTGGGGGC TTGACGTGGA CACGTTCAGG AACTTCATAG ACAACACGTA CAAGGTGGCC ACGCAGAGAG TGGCTACCAA GGAGGATGTG GACAAGATAC AGGAGGAGAT GAACAAGATA TTGGAGGAAG TGGACAAGAG ATTAACCGAA CTTGAGAAAA AAGTTGAGGA GTTAACTCGT GATATCGAGA AGATTGAGGG GGAGTTGGAG ACGACGCAGG TCTTGGAGGG TGTTTATCGC GATCCCACTA TACTGGGAAT TAACCTGGAG AGGAAGACGT TGAGGGTTGA GGACGTGGAG TATGCCTTGG TTGTGGACAA GAACTTTGAG AATTACTCGA GAGAGATTAT GGACAGGATT GCGAGGGGCG ACTTCGTAAT ACTCACTGGG TCTAAGGGCA TTGGTAAGTC AGTCCTTATC AAGTATTCGT TAGCTGAGTT ACTGAAGAGC AAGGTTAGCC CTTATGTGGT TTATGAGGCG AAGGAGCTGA CCGTTGTCCA CAATGTTAAA AGGCTTCACT ACATTGCGGA CAACGCCTTT GGCGAGGTGC CCATAATCTA CTATGACCCA TCTGTCCCGG AGCACTACGA ACCGGGACAA GACCTGACGC CAGAGACAAA GGTTGCGAAT ACTGTGAAAC AGTTGATGAG GTTCTACCAG AGGCGGTCTG AGTCAGGGAA GCAAGTGCCC GTGATGCTTG TGCTACCAAG TGACGTACTC CAACTAGTCA TAGACAACCT TCTCATGAAC AATAAGGGTT TGGAGACTTA CGAGGTTGAG TTCGTGAAGG AGCGTGTTGT GAACATTGAC CTACGCACCT CGCAGTTCTT GAGTGGGGTA ATCAGATCTT ATAGTGGAGG GATTTGTGCT GAAGAGGTTT ATGAGAGGTT GGGTAAGGAG GTTTATGAGA GATATAGGGA TGGATATACA CTAGTGGCTA GATACGTTGG GGAATGGCTC AAGGCCAACA AGTGCAGTGT CGAGGACGTT GAGCAAGCGA TAGGGGCAGG TGCTGGCGTC GCAAAGGCAT TCATATCCCA GTACATATAC AAGGCAATAC TAGGAGGCGA CGACAACATC GAAAGCTTCA AGGAGAAGTT CAGAAAGTAC GCACTCGCCT TTGCAGCGAG GGCCTTTCTA GGCCGAATGC CGCCAAAGTG GCTGATAGGC AGAGTACCAG CAGTCATTAA TAATAAGGGC AAATCGGAAA TTGCCTATGA CGTGCCCATA CACTTGGAAT TCCCAGACCA TGTTGGTAGG TGGCTCTCAA TGAAGCACGA GGACCTAATC GAAGAAACCA TAGAGGATAT AATCGACGGA AAACTATCGA AGGAACTGAA GAAGTACGAA GGTGTAGGTA GGCTCGGAAC AGTAGTGAAT GGAGCTGAGG GAATGGAGAA GGTCATCAAA TGGACTCTCG ATAATTTGAT GAAGATAAAA GGCTCCATTA ACGTAAAAGT GGCAGTGATA GACGTATTCT TTACCGCAGC GGCAAATAGG TTGTTAAGTG AGGAGGAGCA GGTATTAAGG AGTCATCCGC ACGAATTGGC CATGGTGGTG GGACTAGCCC ACACACCATA CCCAATTCCA ATTGAGCTCC TTCTAGAAAC TGATGAGGGT GAGTTGGTTA AGGAGTTGAG AGACTGGATA CTTGTTGGGA ATGAGATGCC ACCTTCTGTA AGGGTGCTCC TGCATGAGAA GGCCTCTTTG TTCAGGGACT TCATTGATGC TTGTGGGATG GTTAATGAAT TGTATAGAGA AGTCAAGGAG AGTGGCAGAG TGAAATTAAC AAGGGTATTT GAGGCATTGG GTCTGCTGGT AATAACAGGT GGAGAATTTG GAAAGGAGTG CCTGGACAAG GCAAACAGAA TCATAGGTAA ATATATTGAG TACGTACAAC TCGCGATTAC TCCTATACAT GGACTTAAGG AGTTAGCGAG AAGGTTAATT GAAAATGGTG AGTATGATGA GATGGTACGT CTCCTGCATA AAATTACGTT GCAGGATCAA AATACTGCCG AAGAGATTCA TTCCGAAGTT GTTTTGCCTA ATCATGATGT GTTCCATGCC AGGTTGAGTT GGGCTGGCAA ATTAATTTAC ATGATGACTC TTTCGAGATT GTCGAAGGTG GATGATTGGA CTAGACTTGC ATCCGAGACC GTTAATGATA TTTGTAGGGA TGAGAGTTTG CATGGTGTAC GTTGCCTATA CGCAAAGGCT AGATCATATC CACTTATCGC TGTCCACAAG TCAATGATAG GTAAACTCGA TGAGGCTAAT AAGTATGTTG ATGAGGCAGT GAAGGCGATG GAGGAATTAA GTAAGAGAAG CAGGGAAGAG TTAGTTAGGG GGCTTGAGAA CGCATTAAGG TTAAGGAGCT CATTTATGGA CATAAGCGAG ACTATTGATA AAGAATTAAA GAGTGTAAGA GCTCACCTTT ACCACCACAC AGCAATAGTG AAGATGAATG TTGACTTAGG GGTTGCATTG AATTATGCCA AAGAGGTGTG CGACATTTCC AGGGAGATAG GGCTTATGAA TGATATAGCT AGCACTTGTG GTTTAAGGGC TCGTTTAAAC TTCTTATCAG GTAACGTCGA GAATGGCGTA AAGGAGTTCG GGGAGCTTTG GAAAGAAGCA TTAGACAAAA CATTTACAGA AATGGATAGA GGTGGTGTCT CTTCCACTCT TGCTCACTAC TTGGTTTCAC TCTTGGCGGT GAATGACCTA AGCAGGGTGG AGGAGGAGTA CGGGACATTT CGTCCATTCC TAGTTAATAA GCCGATTTAT TGGGCTATCG TGGCGGGGTT GATGAGGGTC TATGGACTGC GTGTTGAGGA CTTTTACGAG GCAAGGAGAG ATGCCATAAT GGAGGAAGCA CCTACACCAT TCAGGAAGGT TTTGCTTTGC CTCGAAGTAG ATGGTGAATG TGAAACAATG TGTCAATCGA TAGAAGATCT CGATGATCGC ATATTATGTT GTGAACTGAT CCAAGCCAAG GATAATCCCA AGTTAATGAA AAGCCTCATA ATTTCCACAC CTAAGGAGAA GTATAGGGAC ATTTTAAGTC AGTTGTTAAA GGACATTGAC GATCCCTACG AGATAACTGA GATCGGTGTA TCACCTTCTG GGTCTTTTTT GTCCTTTGTC GAGGTTCTTA GATTCATTAT CGAGGGCAAG CTCAACAAGG CGAGACTAAT CGCCGAGTTT TGGCGTAATG ATTCACGGTC AAGGGGTAAG CCTGTGCCTG AGGGTTTATG GAATGAGTTA GGCAAATCCC TTAGTGGAGA CAGATGCAGT AATGACTGCA GGATGGCGCT CATTAAACTC TTCTACCTCC AAGCTTAG
|
Protein sequence | MSLSLENFIG SAVSNILDGV HDLKSFILVL TPPGSSRGEI VRRLAGKLED VEFLVYEELY KRLNDQDLRS RFKPLRGLSV EGKDILEYIK GDYSNIKEAL STRRVAIVPR STMDAVVLVR RLREELGRDW GSAKKHVVIN HLSSIFKREL EESRSQFTLI ETRHELLRRD DVVKELVRES EGISLALWNR EGKLGEKLES GVKAIRALSP GSVGFDELAR DFAGELKVIM PTVVLTSVAY VAGAMLVAPV IQALVLADGL TAISLAIGRF LEEFTKRVGI YAVKEPIKKF SERFLGWLTK KGEARNKFLK SMGQLLKAVI VAQEFVVDDR FEGIVDEVAS EWGLDVDTFR NFIDNTYKVA TQRVATKEDV DKIQEEMNKI LEEVDKRLTE LEKKVEELTR DIEKIEGELE TTQVLEGVYR DPTILGINLE RKTLRVEDVE YALVVDKNFE NYSREIMDRI ARGDFVILTG SKGIGKSVLI KYSLAELLKS KVSPYVVYEA KELTVVHNVK RLHYIADNAF GEVPIIYYDP SVPEHYEPGQ DLTPETKVAN TVKQLMRFYQ RRSESGKQVP VMLVLPSDVL QLVIDNLLMN NKGLETYEVE FVKERVVNID LRTSQFLSGV IRSYSGGICA EEVYERLGKE VYERYRDGYT LVARYVGEWL KANKCSVEDV EQAIGAGAGV AKAFISQYIY KAILGGDDNI ESFKEKFRKY ALAFAARAFL GRMPPKWLIG RVPAVINNKG KSEIAYDVPI HLEFPDHVGR WLSMKHEDLI EETIEDIIDG KLSKELKKYE GVGRLGTVVN GAEGMEKVIK WTLDNLMKIK GSINVKVAVI DVFFTAAANR LLSEEEQVLR SHPHELAMVV GLAHTPYPIP IELLLETDEG ELVKELRDWI LVGNEMPPSV RVLLHEKASL FRDFIDACGM VNELYREVKE SGRVKLTRVF EALGLLVITG GEFGKECLDK ANRIIGKYIE YVQLAITPIH GLKELARRLI ENGEYDEMVR LLHKITLQDQ NTAEEIHSEV VLPNHDVFHA RLSWAGKLIY MMTLSRLSKV DDWTRLASET VNDICRDESL HGVRCLYAKA RSYPLIAVHK SMIGKLDEAN KYVDEAVKAM EELSKRSREE LVRGLENALR LRSSFMDISE TIDKELKSVR AHLYHHTAIV KMNVDLGVAL NYAKEVCDIS REIGLMNDIA STCGLRARLN FLSGNVENGV KEFGELWKEA LDKTFTEMDR GGVSSTLAHY LVSLLAVNDL SRVEEEYGTF RPFLVNKPIY WAIVAGLMRV YGLRVEDFYE ARRDAIMEEA PTPFRKVLLC LEVDGECETM CQSIEDLDDR ILCCELIQAK DNPKLMKSLI ISTPKEKYRD ILSQLLKDID DPYEITEIGV SPSGSFLSFV EVLRFIIEGK LNKARLIAEF WRNDSRSRGK PVPEGLWNEL GKSLSGDRCS NDCRMALIKL FYLQA
|
| |