Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0911 |
Symbol | |
ID | 5103557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 842213 |
End bp | 844198 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506814 |
Product | Alpha-glucosidase |
Protein accession | YP_001191007 |
Protein GI | 146303691 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.22141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAAGG CTTTTGAGAA GGAAGGAACT TACCAGTTCC TTATAAACGA TCCCTTCCCT CCAGTTGATT TCCAATTTCA GGGAAAGCTT AGCTACAAAT CCCTGCAGGA TTTCGATCTC GAGCTCGAGG AGGAGGAAGG GCTAACCCTC ATCAAACCCC TAGGCATCAA GGATCACGTT TTAGGACACG GAGAGAAGGC CTTTGAACTG GATAGAAAAC GAGGTAAATA CGTGATGTAT AACGTCGATG CTGGGGCATA TCACAAGTAC TCCGACCCCA TGTACGTGAA CATTCCCTTC ATGATAGTGG TGAGAGGAGG AGTTGCCACA GGCTATTTCG TCAACTCGGC CTCCAGGTTA GTCTTTGACG TGGGAAGGGA TCACTACGAC GAAATAAGGA TAACCATACC CGAGGATTAC GTGGAACTCT ACGTCTTTGA GGGACCTAGA ATAGAACAGG TGATCGAGAG ATATGTATCC CTCACGGGGT TACCCTTCCT TCCCCCTGAG TGGGCCCTTG GTTACATGAT ATCGAGGTAC TCATATTACC CTGACACCCA CATCCTAGAA CTTCTTGACC TACTCAGGAA GGATGGGTTT CCCGTGTCGG CAATCTTCCT GGACATCGAT TTCATGGATC AATTCAAGTT GTTTACCTGG CACCCCAAGA GGTTCCCTGA CCCGAAGAAA TTCCTTGAGG AAGTTCACTC GAGAGGGGTC AAGGTAATCA CGATCGTGGA CCACAGCGTG AGGGCAGACC AGAACTACGA GGTATTCAAG TTGGGACTCG GTAAGTACTG TGAGACGGAG AACGGAGACC TGTTCGTGGG GAAACTGTGG CCAGGTAACT GTGTGTACCC TGACTTCTTC CGAGAGGATG CCCGAGGGTG GTGGGCAGAA CTAGTAAGGG AGTGGGTGGA GCAGGGGATA GACGGAATCT GGTTGGACAT GAACGAGCCA ACGGACTTCA CGGAGCTTTT CAGGTTGAGG CAAGCCTGTA GGGACTTCCA GGTTAGGGAG ACTCCTTTCT CCTACGTCTT TCCCGAGAAC GTGGTACACG TCCTGAAAGG GAGGAAAGTC AAACACGGGA AGGTCAGGAA CGCGTATCCC TATTATGAGG CCATGGCCAC CTTTGATGGG GTGTCGAGGG CGAGAAGGGA GATGTTTATC TTAAGTAGGT CGGGCTACGC AGGGATCCAA AGATACGCGG GAATCTGGAC AGGGGATAAT ACCGCCTCCT GGAATCAGCT GAAGTTACAG CTACAGCTAG TTTTAGGGCT TTCGATGTCC GGTGTACCTT ACGTCGGAAT GGACATAGGA GGATTTCAGG GTAGAGAATT TCCAGAGATC GAGAACTCGC CTGAGATGCT CGTTAGGCAC TTTCAGTTGG CGATGTTCTT TCCCTTCTTC AGGACGCATA AGAGCAAGGA TGGAGTAGAT AGTGAACCTG TGTTCCTACC TAGCATGTAC AAGGAGAAGG TGAAGAGGGT CATGGAGACC AGAAAAATGT TCCTGCCCTA CCTTTACGCA CTGGCTGAGG AGGCGCATAG AACTGGACAT CCCATAATCA GGCCTCTCTT TTACGAGTAT CAGGAAGACG AGGACACCTA CAGGATCGAT GACGAGTATC TAGTGGGGAA ATTCCTATTG TATGCCCCTC TCATGGGGAG AGAGGATAGC AGGGACGTGT ACCTTCCTGA AAAGTGGGCA GACTTCTGGA CTGGGGAAGT AATGCAGGGA TGGGTAAGAT CCAAAGATGA GTTGCCAATC TACGTTAGGG AGGGAGCCAT AATTCCCCTG TCAGATCATG GACTGTTAGT GTATGGAAAC GGGGAGTACG AGTATTGGGG AACCAAGATA GTTTCCACAG ACAACATAGT TACATTCTCA CCTCCAGTCT ACATCAAGTC GTTGATACGT ATTGATGAAA ACGGTAGAAG AATGATTAGT GTTAATTCCG AGGTTACAGC AATAAGGATT AAGTAA
|
Protein sequence | MIKAFEKEGT YQFLINDPFP PVDFQFQGKL SYKSLQDFDL ELEEEEGLTL IKPLGIKDHV LGHGEKAFEL DRKRGKYVMY NVDAGAYHKY SDPMYVNIPF MIVVRGGVAT GYFVNSASRL VFDVGRDHYD EIRITIPEDY VELYVFEGPR IEQVIERYVS LTGLPFLPPE WALGYMISRY SYYPDTHILE LLDLLRKDGF PVSAIFLDID FMDQFKLFTW HPKRFPDPKK FLEEVHSRGV KVITIVDHSV RADQNYEVFK LGLGKYCETE NGDLFVGKLW PGNCVYPDFF REDARGWWAE LVREWVEQGI DGIWLDMNEP TDFTELFRLR QACRDFQVRE TPFSYVFPEN VVHVLKGRKV KHGKVRNAYP YYEAMATFDG VSRARREMFI LSRSGYAGIQ RYAGIWTGDN TASWNQLKLQ LQLVLGLSMS GVPYVGMDIG GFQGREFPEI ENSPEMLVRH FQLAMFFPFF RTHKSKDGVD SEPVFLPSMY KEKVKRVMET RKMFLPYLYA LAEEAHRTGH PIIRPLFYEY QEDEDTYRID DEYLVGKFLL YAPLMGREDS RDVYLPEKWA DFWTGEVMQG WVRSKDELPI YVREGAIIPL SDHGLLVYGN GEYEYWGTKI VSTDNIVTFS PPVYIKSLIR IDENGRRMIS VNSEVTAIRI K
|
| |