Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0989 |
Symbol | |
ID | 5104538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 911864 |
End bp | 913615 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506888 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001191081 |
Protein GI | 146303765 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.422576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0726488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTCTGCC TCAACAACGA ATTCACAGGG GCCCTTATAA CTGGAACTGA GGTCGTATGG TTGACCTTTC CCAGATACGA TTCCTCCCCT GTCTTCGCGA AGATTCTCGA CGAGAAGGCT GGTTCCTTCG GTATATCAGG GGAAGTGGCA ACTCAGGAGT ACCTAGTTCC TAACATATTG AAAACTGTTC TGAGGGACGG AACAGAGGTG ATTGACCTCC TCCTAAGGGG AGAGCACTCT CTAGTTAGGA AAATCAATGC AAGGACTCCC CTGGAGATCT GGGCTGATGC AACGTTCAAC TACGGGAAAG TAAGGGCTAA GGTTTACAGG TTAAGTAAGG GAATTTACAA GTTAGCTAAC CCTGAGAACT CGGAATTCCT GGAACTACAT CTCATCTTTC CCTCAATTCA AGAAACTAGC AGGGGGTGGG TGGTATCAGG AGAAGGATAT GCCTTCCTTG GTCATTTCAG TGATGAGAGG TTCGGCATAT TTGGGAAGGA GCTCAAATTC GATGTGGAGA CTGGGGTAGA GAGGACCATA AATTACTGGA GAAACCTGAT TAGGAGGGGG AAGGGAAGGG GTAGGATCTC TCGTATGGAG ATACCGGGAT TTAAGGGAGA GGACCTTCTA ACGGCCTATG AAACGTCTGT AGGTATGCTT TTGGGATTAA TGTATAACCC CACGGGGGCA ATAGTTGCTG CACCCACAAC TTCGCTTCCT GAGATAGAGG GCGGTGTGAG GAACTGGGAC TACCGCTTCG CCTGGGTTAG GGACTCCTCG ATTGTGGCTG AGGGTCTCAT CTCTGCTGGG CACACAATGG ACGCCAGGAG AATCATAGAG TTCCTATCTA GGATGGTGTC GTTCACGACG AAGCCGTTCC TCTACCCACT CTATTCCATA GACGGTTCGG TTCCCCCAAG GGAGGTGGAG ATCCCCTGGC TCTCTGGTTT CATGAACTCC AGGCCCGTGA GGGTCGGAAA CGCGGCGGCA GCTCAGCTTC AGCTAGATCT AGAGGGATTT TTCATGGATG CACTTTACAA GTACTACGTG GCCACGGGGG ACTCCTCCTA CGTGAGGGGA CATCTGGACG TAATAGAGTA CATTGCTGAT TGGGTATCTG AGAACTGGAA GCTTCAGGAC GTAGGAATAT GGGAGGAGAG GGGAGTTCAG GCGCACTATA CCCACTCAAA GGTTATGATG TGGGTAGCTC TGGAGAGGGC AGGAAAGCTA GTGAAGGTGG TGGATAAGGA GAACAGATGG AAAGATACTA GACATGAGAT CAGGGAGTGG ATAACGGAGA ACTGCGTAAA TGATGGGAAG TTCGTAAAGA GGCCTGGAAG CAATGAGGTC GACTCCGCAT TACTTACCCT ACCGCTTTAC GGATTTGTTG AACCAGACGA TCCAACCTTT CTGAACACCT TAAGGGAGAT AGAGAACACC CTGGTAGTTG ACGGCCAGGC CAAAAGGTAT AGGAGGGACT TTCTGGGGGA GGCAAAGTAC CCCTTCACGC TGGCTAGCCT TTGGTTAGCT AGGGTTTACA TAAAGCTGGT GAGGATTGAG GACGCTGAGA GGATCATATT GGGTATCCTA GAGGCCACTC GCGGTACATA CCTCGTGGGA GAGCACATAG ATCCTAAGAG GAAAGTGTTC ACGGGGAATT TCCCGCAGGC CTTTGCCCAA TCTAACTTGA TACTGGCACT CAATGAACTT GCTGAAGCCA AGTCAGTTGC TCCTGACGAG GAAGGTCAAT GA
|
Protein sequence | MFCLNNEFTG ALITGTEVVW LTFPRYDSSP VFAKILDEKA GSFGISGEVA TQEYLVPNIL KTVLRDGTEV IDLLLRGEHS LVRKINARTP LEIWADATFN YGKVRAKVYR LSKGIYKLAN PENSEFLELH LIFPSIQETS RGWVVSGEGY AFLGHFSDER FGIFGKELKF DVETGVERTI NYWRNLIRRG KGRGRISRME IPGFKGEDLL TAYETSVGML LGLMYNPTGA IVAAPTTSLP EIEGGVRNWD YRFAWVRDSS IVAEGLISAG HTMDARRIIE FLSRMVSFTT KPFLYPLYSI DGSVPPREVE IPWLSGFMNS RPVRVGNAAA AQLQLDLEGF FMDALYKYYV ATGDSSYVRG HLDVIEYIAD WVSENWKLQD VGIWEERGVQ AHYTHSKVMM WVALERAGKL VKVVDKENRW KDTRHEIREW ITENCVNDGK FVKRPGSNEV DSALLTLPLY GFVEPDDPTF LNTLREIENT LVVDGQAKRY RRDFLGEAKY PFTLASLWLA RVYIKLVRIE DAERIILGIL EATRGTYLVG EHIDPKRKVF TGNFPQAFAQ SNLILALNEL AEAKSVAPDE EGQ
|
| |