Gene Msed_0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0989 
Symbol 
ID5104538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp911864 
End bp913615 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content50% 
IMG OID640506888 
Productglycoside hydrolase 15-related 
Protein accessionYP_001191081 
Protein GI146303765 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.422576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0726488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCTGCC TCAACAACGA ATTCACAGGG GCCCTTATAA CTGGAACTGA GGTCGTATGG 
TTGACCTTTC CCAGATACGA TTCCTCCCCT GTCTTCGCGA AGATTCTCGA CGAGAAGGCT
GGTTCCTTCG GTATATCAGG GGAAGTGGCA ACTCAGGAGT ACCTAGTTCC TAACATATTG
AAAACTGTTC TGAGGGACGG AACAGAGGTG ATTGACCTCC TCCTAAGGGG AGAGCACTCT
CTAGTTAGGA AAATCAATGC AAGGACTCCC CTGGAGATCT GGGCTGATGC AACGTTCAAC
TACGGGAAAG TAAGGGCTAA GGTTTACAGG TTAAGTAAGG GAATTTACAA GTTAGCTAAC
CCTGAGAACT CGGAATTCCT GGAACTACAT CTCATCTTTC CCTCAATTCA AGAAACTAGC
AGGGGGTGGG TGGTATCAGG AGAAGGATAT GCCTTCCTTG GTCATTTCAG TGATGAGAGG
TTCGGCATAT TTGGGAAGGA GCTCAAATTC GATGTGGAGA CTGGGGTAGA GAGGACCATA
AATTACTGGA GAAACCTGAT TAGGAGGGGG AAGGGAAGGG GTAGGATCTC TCGTATGGAG
ATACCGGGAT TTAAGGGAGA GGACCTTCTA ACGGCCTATG AAACGTCTGT AGGTATGCTT
TTGGGATTAA TGTATAACCC CACGGGGGCA ATAGTTGCTG CACCCACAAC TTCGCTTCCT
GAGATAGAGG GCGGTGTGAG GAACTGGGAC TACCGCTTCG CCTGGGTTAG GGACTCCTCG
ATTGTGGCTG AGGGTCTCAT CTCTGCTGGG CACACAATGG ACGCCAGGAG AATCATAGAG
TTCCTATCTA GGATGGTGTC GTTCACGACG AAGCCGTTCC TCTACCCACT CTATTCCATA
GACGGTTCGG TTCCCCCAAG GGAGGTGGAG ATCCCCTGGC TCTCTGGTTT CATGAACTCC
AGGCCCGTGA GGGTCGGAAA CGCGGCGGCA GCTCAGCTTC AGCTAGATCT AGAGGGATTT
TTCATGGATG CACTTTACAA GTACTACGTG GCCACGGGGG ACTCCTCCTA CGTGAGGGGA
CATCTGGACG TAATAGAGTA CATTGCTGAT TGGGTATCTG AGAACTGGAA GCTTCAGGAC
GTAGGAATAT GGGAGGAGAG GGGAGTTCAG GCGCACTATA CCCACTCAAA GGTTATGATG
TGGGTAGCTC TGGAGAGGGC AGGAAAGCTA GTGAAGGTGG TGGATAAGGA GAACAGATGG
AAAGATACTA GACATGAGAT CAGGGAGTGG ATAACGGAGA ACTGCGTAAA TGATGGGAAG
TTCGTAAAGA GGCCTGGAAG CAATGAGGTC GACTCCGCAT TACTTACCCT ACCGCTTTAC
GGATTTGTTG AACCAGACGA TCCAACCTTT CTGAACACCT TAAGGGAGAT AGAGAACACC
CTGGTAGTTG ACGGCCAGGC CAAAAGGTAT AGGAGGGACT TTCTGGGGGA GGCAAAGTAC
CCCTTCACGC TGGCTAGCCT TTGGTTAGCT AGGGTTTACA TAAAGCTGGT GAGGATTGAG
GACGCTGAGA GGATCATATT GGGTATCCTA GAGGCCACTC GCGGTACATA CCTCGTGGGA
GAGCACATAG ATCCTAAGAG GAAAGTGTTC ACGGGGAATT TCCCGCAGGC CTTTGCCCAA
TCTAACTTGA TACTGGCACT CAATGAACTT GCTGAAGCCA AGTCAGTTGC TCCTGACGAG
GAAGGTCAAT GA
 
Protein sequence
MFCLNNEFTG ALITGTEVVW LTFPRYDSSP VFAKILDEKA GSFGISGEVA TQEYLVPNIL 
KTVLRDGTEV IDLLLRGEHS LVRKINARTP LEIWADATFN YGKVRAKVYR LSKGIYKLAN
PENSEFLELH LIFPSIQETS RGWVVSGEGY AFLGHFSDER FGIFGKELKF DVETGVERTI
NYWRNLIRRG KGRGRISRME IPGFKGEDLL TAYETSVGML LGLMYNPTGA IVAAPTTSLP
EIEGGVRNWD YRFAWVRDSS IVAEGLISAG HTMDARRIIE FLSRMVSFTT KPFLYPLYSI
DGSVPPREVE IPWLSGFMNS RPVRVGNAAA AQLQLDLEGF FMDALYKYYV ATGDSSYVRG
HLDVIEYIAD WVSENWKLQD VGIWEERGVQ AHYTHSKVMM WVALERAGKL VKVVDKENRW
KDTRHEIREW ITENCVNDGK FVKRPGSNEV DSALLTLPLY GFVEPDDPTF LNTLREIENT
LVVDGQAKRY RRDFLGEAKY PFTLASLWLA RVYIKLVRIE DAERIILGIL EATRGTYLVG
EHIDPKRKVF TGNFPQAFAQ SNLILALNEL AEAKSVAPDE EGQ