Gene Msed_0911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0911 
Symbol 
ID5103557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp842213 
End bp844198 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content49% 
IMG OID640506814 
ProductAlpha-glucosidase 
Protein accessionYP_001191007 
Protein GI146303691 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.22141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAGG CTTTTGAGAA GGAAGGAACT TACCAGTTCC TTATAAACGA TCCCTTCCCT 
CCAGTTGATT TCCAATTTCA GGGAAAGCTT AGCTACAAAT CCCTGCAGGA TTTCGATCTC
GAGCTCGAGG AGGAGGAAGG GCTAACCCTC ATCAAACCCC TAGGCATCAA GGATCACGTT
TTAGGACACG GAGAGAAGGC CTTTGAACTG GATAGAAAAC GAGGTAAATA CGTGATGTAT
AACGTCGATG CTGGGGCATA TCACAAGTAC TCCGACCCCA TGTACGTGAA CATTCCCTTC
ATGATAGTGG TGAGAGGAGG AGTTGCCACA GGCTATTTCG TCAACTCGGC CTCCAGGTTA
GTCTTTGACG TGGGAAGGGA TCACTACGAC GAAATAAGGA TAACCATACC CGAGGATTAC
GTGGAACTCT ACGTCTTTGA GGGACCTAGA ATAGAACAGG TGATCGAGAG ATATGTATCC
CTCACGGGGT TACCCTTCCT TCCCCCTGAG TGGGCCCTTG GTTACATGAT ATCGAGGTAC
TCATATTACC CTGACACCCA CATCCTAGAA CTTCTTGACC TACTCAGGAA GGATGGGTTT
CCCGTGTCGG CAATCTTCCT GGACATCGAT TTCATGGATC AATTCAAGTT GTTTACCTGG
CACCCCAAGA GGTTCCCTGA CCCGAAGAAA TTCCTTGAGG AAGTTCACTC GAGAGGGGTC
AAGGTAATCA CGATCGTGGA CCACAGCGTG AGGGCAGACC AGAACTACGA GGTATTCAAG
TTGGGACTCG GTAAGTACTG TGAGACGGAG AACGGAGACC TGTTCGTGGG GAAACTGTGG
CCAGGTAACT GTGTGTACCC TGACTTCTTC CGAGAGGATG CCCGAGGGTG GTGGGCAGAA
CTAGTAAGGG AGTGGGTGGA GCAGGGGATA GACGGAATCT GGTTGGACAT GAACGAGCCA
ACGGACTTCA CGGAGCTTTT CAGGTTGAGG CAAGCCTGTA GGGACTTCCA GGTTAGGGAG
ACTCCTTTCT CCTACGTCTT TCCCGAGAAC GTGGTACACG TCCTGAAAGG GAGGAAAGTC
AAACACGGGA AGGTCAGGAA CGCGTATCCC TATTATGAGG CCATGGCCAC CTTTGATGGG
GTGTCGAGGG CGAGAAGGGA GATGTTTATC TTAAGTAGGT CGGGCTACGC AGGGATCCAA
AGATACGCGG GAATCTGGAC AGGGGATAAT ACCGCCTCCT GGAATCAGCT GAAGTTACAG
CTACAGCTAG TTTTAGGGCT TTCGATGTCC GGTGTACCTT ACGTCGGAAT GGACATAGGA
GGATTTCAGG GTAGAGAATT TCCAGAGATC GAGAACTCGC CTGAGATGCT CGTTAGGCAC
TTTCAGTTGG CGATGTTCTT TCCCTTCTTC AGGACGCATA AGAGCAAGGA TGGAGTAGAT
AGTGAACCTG TGTTCCTACC TAGCATGTAC AAGGAGAAGG TGAAGAGGGT CATGGAGACC
AGAAAAATGT TCCTGCCCTA CCTTTACGCA CTGGCTGAGG AGGCGCATAG AACTGGACAT
CCCATAATCA GGCCTCTCTT TTACGAGTAT CAGGAAGACG AGGACACCTA CAGGATCGAT
GACGAGTATC TAGTGGGGAA ATTCCTATTG TATGCCCCTC TCATGGGGAG AGAGGATAGC
AGGGACGTGT ACCTTCCTGA AAAGTGGGCA GACTTCTGGA CTGGGGAAGT AATGCAGGGA
TGGGTAAGAT CCAAAGATGA GTTGCCAATC TACGTTAGGG AGGGAGCCAT AATTCCCCTG
TCAGATCATG GACTGTTAGT GTATGGAAAC GGGGAGTACG AGTATTGGGG AACCAAGATA
GTTTCCACAG ACAACATAGT TACATTCTCA CCTCCAGTCT ACATCAAGTC GTTGATACGT
ATTGATGAAA ACGGTAGAAG AATGATTAGT GTTAATTCCG AGGTTACAGC AATAAGGATT
AAGTAA
 
Protein sequence
MIKAFEKEGT YQFLINDPFP PVDFQFQGKL SYKSLQDFDL ELEEEEGLTL IKPLGIKDHV 
LGHGEKAFEL DRKRGKYVMY NVDAGAYHKY SDPMYVNIPF MIVVRGGVAT GYFVNSASRL
VFDVGRDHYD EIRITIPEDY VELYVFEGPR IEQVIERYVS LTGLPFLPPE WALGYMISRY
SYYPDTHILE LLDLLRKDGF PVSAIFLDID FMDQFKLFTW HPKRFPDPKK FLEEVHSRGV
KVITIVDHSV RADQNYEVFK LGLGKYCETE NGDLFVGKLW PGNCVYPDFF REDARGWWAE
LVREWVEQGI DGIWLDMNEP TDFTELFRLR QACRDFQVRE TPFSYVFPEN VVHVLKGRKV
KHGKVRNAYP YYEAMATFDG VSRARREMFI LSRSGYAGIQ RYAGIWTGDN TASWNQLKLQ
LQLVLGLSMS GVPYVGMDIG GFQGREFPEI ENSPEMLVRH FQLAMFFPFF RTHKSKDGVD
SEPVFLPSMY KEKVKRVMET RKMFLPYLYA LAEEAHRTGH PIIRPLFYEY QEDEDTYRID
DEYLVGKFLL YAPLMGREDS RDVYLPEKWA DFWTGEVMQG WVRSKDELPI YVREGAIIPL
SDHGLLVYGN GEYEYWGTKI VSTDNIVTFS PPVYIKSLIR IDENGRRMIS VNSEVTAIRI
K