Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1421 |
Symbol | |
ID | 5104792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1388990 |
End bp | 1390825 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507310 |
Product | glycogen debranching enzyme, putative |
Protein accession | YP_001191503 |
Protein GI | 146304187 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3408] Glycogen debranching enzyme |
TIGRFAM ID | [TIGR01561] glycogen debranching enzyme, archaeal type, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.569288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0701501 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACC CGAAGGAATG CGAGGACAGG GAGTGGATAA TACCTACTGG GACCGGAGGT TATTCGTCCT CAACCTTCTG CGGAATAAAC TCCAGAACTT ATCACGGCCT GCTAGTTATA CCGCAGGATC CACCTCACAG GAGATACATG ACCCTGGCCA AGGTGGAGGA TTTCGTTATA ACTGACGGCC AAGAGTACCC CATGAGCACG AACCATTACC TGAACGACGT GTTTTATCCG GAGGGGTACA GGTTCCTGAA TCACGTGGAG CGGGGAGAGA ACTTTGTTAG ATGGGACTTC CTTTTTGGGA ATTCAAGGGT CGAGAGAACC CTGGTTGTGC ACAGGGGTTA CAACGCCATA ACCCTGTCCT ACGCTTCCCA GAGGGGAGTT TTCAGGATAT GTCCCCTAGT CACGTACAGG AGTCATCATG TGGCTCTGAA GTCGGTTCAC CCCATCTTCA CGTACAGGCT TCTTCAGGAC CACATTCTCC TTCTCGCGAA TGGGATACCC TTCCTCAGGG TCAGGATAAG GGGAGACCAC GTCCTCGATA AGACGGAGTA CTGGTACTAT AACTTCTTTT ACCGTTTAGA CTTCGAGAGG GGAACCAATT ACCTGGAGGA CCTGTACAAT CCCTTCTGCG TGATCAGCAA GGGGAACAAG ATTGAGATGG ACTTCTACTG GGGGGAATTT GAGCCCGAGC AGAAAAGGGT TGGGTCCAAA GAGATCATGG ACCTTCTTTC AAGTGCGGGG AAAAGCTTCG TCGTGAGAAG CGGAGACAAG TACGCGATCA TTGCAGGATA TCACTGGTTT GATGAGTGGG GAAGGGATAC CATGATCTCC ATGGAGGGGA TCCTGCTCAT GAACGGGTTG TATGAACAGG CCAAGAGCAT CCTCTTGAGG TATTTCAATG CAGTTAACAG GGGCCTGATG CCCAATAACT TCCTGGGAAA CAACGAGACC GCCTACAAGG GAGTCGACGT TTCGCTATGG GGAATCAACG CCGTGTACAA GTACTATCAG TACACGAATG ACGTTGAGTT CCTGAAAAGG ATATTCCCAA GAATGCTGGA GGTCGTGGAC TCTTACTGGA AAGGAAACGG AGTTGTGGTG AACAAGGACA ACCTCTTGTA TCACGTTGGA GCACCTAGGA CTTGGATGGA CGCTCAGTTT GACGGTGAGG TCGTGACTCC AAGGGAAGGA GCAGCTGTCG AGATCAATGC CCTATGGTAC AACGCGTTAA TGATCATGGA CCAGATCTCC AAGAGGTTGG GAATACATGA CGACGAGTTC GTAGAGAAGG CCGAAAAGGT GAGGTCGGCG TTCCTGGAGA AGTTTCCTTC GGAGGCTGGG CTATATGACT ACATTGGATG GGACGATAAG CCGGGGAAGG AGATTAGACC CAATCAGCTG GTTGCTCTTG GCCTTCCTTA CCCTGTGGTC TCCAAGGATA TCGCCATGAG GGTACTGGAG GTGGTGGAGA CGGAACTGTT GAGGCCATAC GGGTTGAGCA CCCTCTCCAA GCGGGATAAA GGTTACACAC CCTTTTACAG GGGCGATAGG GCCAGTAGAG ACAGAGCGTA TCATAACGGC CCGATATGGC CATGGCTCGT GGGAATCTAC GTTGATGCTA AGCTCAACTT TGAATACGAT TCCCTCAGAA TCAAGAACCT GCTGAACCAA TTCAGTCCCC TTCTAGGAGT GGCCGTGAGG GAAAATGGAT ACGTTCCTGA GCTCTTTGAG GATATTCCTC CCTACAAGAA GGGCGGATGT ATTGCTCAAG CTTGGAGTGT CGCAGAATTG AACAGGGCAA TTAGAAATAT CATCAATTAC TCGTGA
|
Protein sequence | MLDPKECEDR EWIIPTGTGG YSSSTFCGIN SRTYHGLLVI PQDPPHRRYM TLAKVEDFVI TDGQEYPMST NHYLNDVFYP EGYRFLNHVE RGENFVRWDF LFGNSRVERT LVVHRGYNAI TLSYASQRGV FRICPLVTYR SHHVALKSVH PIFTYRLLQD HILLLANGIP FLRVRIRGDH VLDKTEYWYY NFFYRLDFER GTNYLEDLYN PFCVISKGNK IEMDFYWGEF EPEQKRVGSK EIMDLLSSAG KSFVVRSGDK YAIIAGYHWF DEWGRDTMIS MEGILLMNGL YEQAKSILLR YFNAVNRGLM PNNFLGNNET AYKGVDVSLW GINAVYKYYQ YTNDVEFLKR IFPRMLEVVD SYWKGNGVVV NKDNLLYHVG APRTWMDAQF DGEVVTPREG AAVEINALWY NALMIMDQIS KRLGIHDDEF VEKAEKVRSA FLEKFPSEAG LYDYIGWDDK PGKEIRPNQL VALGLPYPVV SKDIAMRVLE VVETELLRPY GLSTLSKRDK GYTPFYRGDR ASRDRAYHNG PIWPWLVGIY VDAKLNFEYD SLRIKNLLNQ FSPLLGVAVR ENGYVPELFE DIPPYKKGGC IAQAWSVAEL NRAIRNIINY S
|
| |