Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1322 |
Symbol | |
ID | 5104573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1299707 |
End bp | 1301404 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507211 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001191404 |
Protein GI | 146304088 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.033472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTAGGCT TCATTTCCAA TCAAATAACA TCCGCTATCA TTGACGGCAC ATCGGTTGTC TGGTTTCCAG TTCCCAAGTT CGACTCTCCC TCAATCTTCT CCAAGTTGGT TGATGAAAGG GGAGGGGAAT TCTCCGTGGT GCCGGGAAAG GTAACTTACA TGGCTCAGGA ATACAGGGAC CCAATGGTTC TCACAACTTA CGTGGAAACA GACCAGGGAA AGATGGTTAT CCAAGACCTT ATTCCCATAG GGGAGACCAT TATTATAAGG AGGGTGGAGA GCGAGTTTCC CTTCAGGGTT GTGTTCAATC CAATATTCCA TTATGGTCTC TACAGACCAG TTCCCGATGG CAATAGGAGA ATCAATCCAA GGGGGAGGGA CTGTGTGGCG TTTCTTTATG AGTATGATGG AGAAGTGGAG ATGGAGAGCG ACGATGTCTG GAAATTCTCA AGCGGAAAGG GATATCTCGT AGCTAATTAC TCATCAGATG CCAAGCACGG TCCATTGAGC GAGAGGACCT CCCACCTATC CCTTGACTTC TCCAGACCCT TTGAGAAGAC CGTTGAGTAC TGGAGAAGGT CCCTACCCAA GTCCAGGATA TACCTAGAGG ATTTATACAC CACTTCACTT GCAGTTCTCC TGGGATCCAT CTACGCCCCT TCGGGAGGTC CAGTGGCTTC CCCCACAACC TCCCTACCAG AGGTGATAGG TGGGTCCAGG AACTGGGACT ACCGTTTTGC GTGGGTAAGG GACTCCTCTA TTATCGCCGA GTCCCTCCTA GACGCGGATT ACGTGGTGAA GGCCAGGGAC ATAATTAACT TCTTGCTCTC CCTGATTAAC TTCTCGTCGA AACCTTTCTT CTACCCCCTC TACACGGTGG AGGGAACTAT CCCACCTCCA GAGAGGAAAC TTCCCTGGCT CTCAGGTTTC AGGGGATCTA GACCGGTTAC GGTGGGAAAC GGGGCGTCAA CTCAGGTTCA GCTCGACGTC GAGGGATTTT TCATGGCAAC GCTCTACAAG TACTTTGAGA AGACGGGAGA TAGGGTATAC ATTTACGATG CCCTCGAGAA GATTTTCTAT CTCGCTGACT GGGAGGCAGA GAACTGGAGA ATGAAGGACT CAGGGATATG GGAAGATAGG GGAGAGCCTC AGCACTACAT TCACTCTAAG GTGATGATGT GGGTTGCCAT GGACAGGGCT GGGAAGATCG CGAGTACCCT AGGCATGCAG GATCGATGGA AGGACGCTAG GGAGGAGCTG AGGTCCTGGA TTCTTGAACA GTCAGGAGAG TACTTTCCTA GATATCCAGG AAGTGACCAG GTTGACGCCT CGATCCTCTC GGCACCCCTT TACGATTTCG TTGACGTTAA CGATAAGGTA TTTCTAAATA CCCTACGCAG GGTAGAGAGG GATCTGGTTA AGGACGGATT CGTCAAGAGA TATGTTTCGG ACTTCATGGG AGAGGCGAAA CATCCCTTCC TCCTCACCAC GCTTTGGCTG GCAAGGATTT ACATAAGGTT AGGCGAAACA GGGAAGGCCA GGGATCTCCT GGAGAGGTTA GACAGGGTCT CCGGCAGCCT TCACCTCCTA GGAGAGCATC TGGACACCTC CACCCTAGAG TTCACCGGTA ACTTCCCCCA GGTTTTCGTT CACGCACAGG TTGTTTCGGC ACTCAAGGAA CTGGAACGAT TTCAGTGA
|
Protein sequence | MLGFISNQIT SAIIDGTSVV WFPVPKFDSP SIFSKLVDER GGEFSVVPGK VTYMAQEYRD PMVLTTYVET DQGKMVIQDL IPIGETIIIR RVESEFPFRV VFNPIFHYGL YRPVPDGNRR INPRGRDCVA FLYEYDGEVE MESDDVWKFS SGKGYLVANY SSDAKHGPLS ERTSHLSLDF SRPFEKTVEY WRRSLPKSRI YLEDLYTTSL AVLLGSIYAP SGGPVASPTT SLPEVIGGSR NWDYRFAWVR DSSIIAESLL DADYVVKARD IINFLLSLIN FSSKPFFYPL YTVEGTIPPP ERKLPWLSGF RGSRPVTVGN GASTQVQLDV EGFFMATLYK YFEKTGDRVY IYDALEKIFY LADWEAENWR MKDSGIWEDR GEPQHYIHSK VMMWVAMDRA GKIASTLGMQ DRWKDAREEL RSWILEQSGE YFPRYPGSDQ VDASILSAPL YDFVDVNDKV FLNTLRRVER DLVKDGFVKR YVSDFMGEAK HPFLLTTLWL ARIYIRLGET GKARDLLERL DRVSGSLHLL GEHLDTSTLE FTGNFPQVFV HAQVVSALKE LERFQ
|
| |