Gene Msed_1322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1322 
Symbol 
ID5104573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1299707 
End bp1301404 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content51% 
IMG OID640507211 
Productglycoside hydrolase 15-related 
Protein accessionYP_001191404 
Protein GI146304088 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.033472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAGGCT TCATTTCCAA TCAAATAACA TCCGCTATCA TTGACGGCAC ATCGGTTGTC 
TGGTTTCCAG TTCCCAAGTT CGACTCTCCC TCAATCTTCT CCAAGTTGGT TGATGAAAGG
GGAGGGGAAT TCTCCGTGGT GCCGGGAAAG GTAACTTACA TGGCTCAGGA ATACAGGGAC
CCAATGGTTC TCACAACTTA CGTGGAAACA GACCAGGGAA AGATGGTTAT CCAAGACCTT
ATTCCCATAG GGGAGACCAT TATTATAAGG AGGGTGGAGA GCGAGTTTCC CTTCAGGGTT
GTGTTCAATC CAATATTCCA TTATGGTCTC TACAGACCAG TTCCCGATGG CAATAGGAGA
ATCAATCCAA GGGGGAGGGA CTGTGTGGCG TTTCTTTATG AGTATGATGG AGAAGTGGAG
ATGGAGAGCG ACGATGTCTG GAAATTCTCA AGCGGAAAGG GATATCTCGT AGCTAATTAC
TCATCAGATG CCAAGCACGG TCCATTGAGC GAGAGGACCT CCCACCTATC CCTTGACTTC
TCCAGACCCT TTGAGAAGAC CGTTGAGTAC TGGAGAAGGT CCCTACCCAA GTCCAGGATA
TACCTAGAGG ATTTATACAC CACTTCACTT GCAGTTCTCC TGGGATCCAT CTACGCCCCT
TCGGGAGGTC CAGTGGCTTC CCCCACAACC TCCCTACCAG AGGTGATAGG TGGGTCCAGG
AACTGGGACT ACCGTTTTGC GTGGGTAAGG GACTCCTCTA TTATCGCCGA GTCCCTCCTA
GACGCGGATT ACGTGGTGAA GGCCAGGGAC ATAATTAACT TCTTGCTCTC CCTGATTAAC
TTCTCGTCGA AACCTTTCTT CTACCCCCTC TACACGGTGG AGGGAACTAT CCCACCTCCA
GAGAGGAAAC TTCCCTGGCT CTCAGGTTTC AGGGGATCTA GACCGGTTAC GGTGGGAAAC
GGGGCGTCAA CTCAGGTTCA GCTCGACGTC GAGGGATTTT TCATGGCAAC GCTCTACAAG
TACTTTGAGA AGACGGGAGA TAGGGTATAC ATTTACGATG CCCTCGAGAA GATTTTCTAT
CTCGCTGACT GGGAGGCAGA GAACTGGAGA ATGAAGGACT CAGGGATATG GGAAGATAGG
GGAGAGCCTC AGCACTACAT TCACTCTAAG GTGATGATGT GGGTTGCCAT GGACAGGGCT
GGGAAGATCG CGAGTACCCT AGGCATGCAG GATCGATGGA AGGACGCTAG GGAGGAGCTG
AGGTCCTGGA TTCTTGAACA GTCAGGAGAG TACTTTCCTA GATATCCAGG AAGTGACCAG
GTTGACGCCT CGATCCTCTC GGCACCCCTT TACGATTTCG TTGACGTTAA CGATAAGGTA
TTTCTAAATA CCCTACGCAG GGTAGAGAGG GATCTGGTTA AGGACGGATT CGTCAAGAGA
TATGTTTCGG ACTTCATGGG AGAGGCGAAA CATCCCTTCC TCCTCACCAC GCTTTGGCTG
GCAAGGATTT ACATAAGGTT AGGCGAAACA GGGAAGGCCA GGGATCTCCT GGAGAGGTTA
GACAGGGTCT CCGGCAGCCT TCACCTCCTA GGAGAGCATC TGGACACCTC CACCCTAGAG
TTCACCGGTA ACTTCCCCCA GGTTTTCGTT CACGCACAGG TTGTTTCGGC ACTCAAGGAA
CTGGAACGAT TTCAGTGA
 
Protein sequence
MLGFISNQIT SAIIDGTSVV WFPVPKFDSP SIFSKLVDER GGEFSVVPGK VTYMAQEYRD 
PMVLTTYVET DQGKMVIQDL IPIGETIIIR RVESEFPFRV VFNPIFHYGL YRPVPDGNRR
INPRGRDCVA FLYEYDGEVE MESDDVWKFS SGKGYLVANY SSDAKHGPLS ERTSHLSLDF
SRPFEKTVEY WRRSLPKSRI YLEDLYTTSL AVLLGSIYAP SGGPVASPTT SLPEVIGGSR
NWDYRFAWVR DSSIIAESLL DADYVVKARD IINFLLSLIN FSSKPFFYPL YTVEGTIPPP
ERKLPWLSGF RGSRPVTVGN GASTQVQLDV EGFFMATLYK YFEKTGDRVY IYDALEKIFY
LADWEAENWR MKDSGIWEDR GEPQHYIHSK VMMWVAMDRA GKIASTLGMQ DRWKDAREEL
RSWILEQSGE YFPRYPGSDQ VDASILSAPL YDFVDVNDKV FLNTLRRVER DLVKDGFVKR
YVSDFMGEAK HPFLLTTLWL ARIYIRLGET GKARDLLERL DRVSGSLHLL GEHLDTSTLE
FTGNFPQVFV HAQVVSALKE LERFQ