Gene Msed_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1421 
Symbol 
ID5104792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1388990 
End bp1390825 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content51% 
IMG OID640507310 
Productglycogen debranching enzyme, putative 
Protein accessionYP_001191503 
Protein GI146304187 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID[TIGR01561] glycogen debranching enzyme, archaeal type, putative 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.569288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0701501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACC CGAAGGAATG CGAGGACAGG GAGTGGATAA TACCTACTGG GACCGGAGGT 
TATTCGTCCT CAACCTTCTG CGGAATAAAC TCCAGAACTT ATCACGGCCT GCTAGTTATA
CCGCAGGATC CACCTCACAG GAGATACATG ACCCTGGCCA AGGTGGAGGA TTTCGTTATA
ACTGACGGCC AAGAGTACCC CATGAGCACG AACCATTACC TGAACGACGT GTTTTATCCG
GAGGGGTACA GGTTCCTGAA TCACGTGGAG CGGGGAGAGA ACTTTGTTAG ATGGGACTTC
CTTTTTGGGA ATTCAAGGGT CGAGAGAACC CTGGTTGTGC ACAGGGGTTA CAACGCCATA
ACCCTGTCCT ACGCTTCCCA GAGGGGAGTT TTCAGGATAT GTCCCCTAGT CACGTACAGG
AGTCATCATG TGGCTCTGAA GTCGGTTCAC CCCATCTTCA CGTACAGGCT TCTTCAGGAC
CACATTCTCC TTCTCGCGAA TGGGATACCC TTCCTCAGGG TCAGGATAAG GGGAGACCAC
GTCCTCGATA AGACGGAGTA CTGGTACTAT AACTTCTTTT ACCGTTTAGA CTTCGAGAGG
GGAACCAATT ACCTGGAGGA CCTGTACAAT CCCTTCTGCG TGATCAGCAA GGGGAACAAG
ATTGAGATGG ACTTCTACTG GGGGGAATTT GAGCCCGAGC AGAAAAGGGT TGGGTCCAAA
GAGATCATGG ACCTTCTTTC AAGTGCGGGG AAAAGCTTCG TCGTGAGAAG CGGAGACAAG
TACGCGATCA TTGCAGGATA TCACTGGTTT GATGAGTGGG GAAGGGATAC CATGATCTCC
ATGGAGGGGA TCCTGCTCAT GAACGGGTTG TATGAACAGG CCAAGAGCAT CCTCTTGAGG
TATTTCAATG CAGTTAACAG GGGCCTGATG CCCAATAACT TCCTGGGAAA CAACGAGACC
GCCTACAAGG GAGTCGACGT TTCGCTATGG GGAATCAACG CCGTGTACAA GTACTATCAG
TACACGAATG ACGTTGAGTT CCTGAAAAGG ATATTCCCAA GAATGCTGGA GGTCGTGGAC
TCTTACTGGA AAGGAAACGG AGTTGTGGTG AACAAGGACA ACCTCTTGTA TCACGTTGGA
GCACCTAGGA CTTGGATGGA CGCTCAGTTT GACGGTGAGG TCGTGACTCC AAGGGAAGGA
GCAGCTGTCG AGATCAATGC CCTATGGTAC AACGCGTTAA TGATCATGGA CCAGATCTCC
AAGAGGTTGG GAATACATGA CGACGAGTTC GTAGAGAAGG CCGAAAAGGT GAGGTCGGCG
TTCCTGGAGA AGTTTCCTTC GGAGGCTGGG CTATATGACT ACATTGGATG GGACGATAAG
CCGGGGAAGG AGATTAGACC CAATCAGCTG GTTGCTCTTG GCCTTCCTTA CCCTGTGGTC
TCCAAGGATA TCGCCATGAG GGTACTGGAG GTGGTGGAGA CGGAACTGTT GAGGCCATAC
GGGTTGAGCA CCCTCTCCAA GCGGGATAAA GGTTACACAC CCTTTTACAG GGGCGATAGG
GCCAGTAGAG ACAGAGCGTA TCATAACGGC CCGATATGGC CATGGCTCGT GGGAATCTAC
GTTGATGCTA AGCTCAACTT TGAATACGAT TCCCTCAGAA TCAAGAACCT GCTGAACCAA
TTCAGTCCCC TTCTAGGAGT GGCCGTGAGG GAAAATGGAT ACGTTCCTGA GCTCTTTGAG
GATATTCCTC CCTACAAGAA GGGCGGATGT ATTGCTCAAG CTTGGAGTGT CGCAGAATTG
AACAGGGCAA TTAGAAATAT CATCAATTAC TCGTGA
 
Protein sequence
MLDPKECEDR EWIIPTGTGG YSSSTFCGIN SRTYHGLLVI PQDPPHRRYM TLAKVEDFVI 
TDGQEYPMST NHYLNDVFYP EGYRFLNHVE RGENFVRWDF LFGNSRVERT LVVHRGYNAI
TLSYASQRGV FRICPLVTYR SHHVALKSVH PIFTYRLLQD HILLLANGIP FLRVRIRGDH
VLDKTEYWYY NFFYRLDFER GTNYLEDLYN PFCVISKGNK IEMDFYWGEF EPEQKRVGSK
EIMDLLSSAG KSFVVRSGDK YAIIAGYHWF DEWGRDTMIS MEGILLMNGL YEQAKSILLR
YFNAVNRGLM PNNFLGNNET AYKGVDVSLW GINAVYKYYQ YTNDVEFLKR IFPRMLEVVD
SYWKGNGVVV NKDNLLYHVG APRTWMDAQF DGEVVTPREG AAVEINALWY NALMIMDQIS
KRLGIHDDEF VEKAEKVRSA FLEKFPSEAG LYDYIGWDDK PGKEIRPNQL VALGLPYPVV
SKDIAMRVLE VVETELLRPY GLSTLSKRDK GYTPFYRGDR ASRDRAYHNG PIWPWLVGIY
VDAKLNFEYD SLRIKNLLNQ FSPLLGVAVR ENGYVPELFE DIPPYKKGGC IAQAWSVAEL
NRAIRNIINY S