Gene Msed_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1420 
Symbol 
ID5104791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1387138 
End bp1388985 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content49% 
IMG OID640507309 
Productglycoside hydrolase 15-related 
Protein accessionYP_001191502 
Protein GI146304186 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0134624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTTG CCAGCATTGG AAACGGAAAA ATGCTAGTGA ACTTTGATGA CCACGGGAGG 
ATTATTGACC TCTATTACCC CTATATAGGA ATGGAGAACC AAACGTCTGG TATTCCCATA
AGGGTTGCGC TCTGGGATGG GAAGAACGTT TATCTGGATG AGTCATGGAA GACCGAGGTC
TCGTACGAGG ACGGAACCAA TCTTGTGGAG GTCAAGTGGA CCCTAGATAA CCCAGGACTT
GAAATAACTT CCTACAACTT CGTCGACGTG AATGAACCTG TGATGAATTC CATAATAAAG
ATACTATCCA GGGATATTGA GGGAAAGCTC AGGCTCTTCT TCGTTCACGA CCTAAACATT
TATTCCAATC CCTTTGGAGA TACTGCACTT CTGGACCCTG TTACCTGGTC CATGATTCAC
TACAAGTCCA AGAGGTACCT AGGAATCAAG CTCATGTCCA CTGAAATGAA CAACACGGAA
TTCTCCGCAA CCAAGGGCGA TCCCTTGGAG GATATAAAGG ACGGGAGATT GGATGGTAGC
CCGATCTCTC ACGGAGACGT GAAGTCCGCG GTGGGGGTGG AACTAAACCT CAGGAGTAAA
TCCTTCGTGA AGGCCTATTA CGTAATAGGG GCAGCGAGGA ATCTCGAGGA GTTGAGGAGA
CTTCTCGGTG AGGCGAACCC AGCCAAGATA GAGAGCAACT TCGTCTCAGT GTTCCAGTTC
TGGAAGAGCT GGCTATCTAA GGGTAGCTGG ACATCTGATC ACGAGAGCTG GATATACAAC
GTTAGTCTCC TGACCGTGAA GAATCACATG GACATGAATG GATCGATCAT AGCCTCCTCA
GATTTCTCCT TTGTGAACAT CTATGGGGAC TCTTATCAAT ACTTCTGGCC TAGAGACGGG
GCCATAGCTG CGCACTCGCT GGATGTTGCG GGATATGGAG AACTGGCCAT GAAGCACTTC
AACTTTGTGA AGGAGATTGC AAATCCCGAG GGTTATCTAC ACCACAAGTA CAACCCCAAC
AGGACGCTTG CAAGCTCGTG GCACCCCTGG CTCTATAACG GGAAGAGGAT CCTGCCAATA
CAGGAGGACG AAACTGCTCT CGAGGTCTGG GCCATAGGAA GTCATTACAG GAGGTATAAG
GACTTGGACG AACTCACAGA GATTTACAGG AAGTTTGTGA AACCAGCTCT ACAGTTCATG
ATGAGGTACA CCGAGGACGG ACTTCCAAAA CCGAGCTTTG ATCTGTGGGA GGAGAGGTAT
GGAATTCACC TCTACACTGT GTCAACGGTG TATGGAGGGT TGGTCATGGG AGCAGAACTC
GCCAAGGGAA TGGGAGACGA AAGCCTTTCA GAAGACGCTC TAGACGTGGC CAAGACCATG
AAGGAACAGG CCCTTTCCAG GTTGACCAAT GGGAGGAGAT TCATCAGGAG GCTAGACGAG
AACTATCAGC CCGATCAGGT TGTGGACGCA AGCATGTATG CCCCGTACTA CTTTGGGATG
GTGGAACCAA ATCATCCCAT TATGATCTCC ACCATGGAGG CCATAGAACA GAGGTTAATG
ATAAACGGTG GAATCGCGAG ATACGAGAAC GACATGTACC AGAGGAGAAA GGCTCAGCCC
AATCCCTGGA TAATCACAAC CCTATGGGTT GCTCAATACA TGATAGATAC TTCCAGGCTG
GATAAGGCCA AGGATCTCCT GACCTGGGTT ATGAAGAGGG CAACCCCCTC TGGTTTCCTT
CCTGAACAGG TTGACCCAGA GACGTGGGAG TCTACCTCCG TCATCCCCCT TGTGTGGTCG
CATGCGGAAC TAATAATTAC ATTAAATAAA TACCACGGCA AATACTAA
 
Protein sequence
MRLASIGNGK MLVNFDDHGR IIDLYYPYIG MENQTSGIPI RVALWDGKNV YLDESWKTEV 
SYEDGTNLVE VKWTLDNPGL EITSYNFVDV NEPVMNSIIK ILSRDIEGKL RLFFVHDLNI
YSNPFGDTAL LDPVTWSMIH YKSKRYLGIK LMSTEMNNTE FSATKGDPLE DIKDGRLDGS
PISHGDVKSA VGVELNLRSK SFVKAYYVIG AARNLEELRR LLGEANPAKI ESNFVSVFQF
WKSWLSKGSW TSDHESWIYN VSLLTVKNHM DMNGSIIASS DFSFVNIYGD SYQYFWPRDG
AIAAHSLDVA GYGELAMKHF NFVKEIANPE GYLHHKYNPN RTLASSWHPW LYNGKRILPI
QEDETALEVW AIGSHYRRYK DLDELTEIYR KFVKPALQFM MRYTEDGLPK PSFDLWEERY
GIHLYTVSTV YGGLVMGAEL AKGMGDESLS EDALDVAKTM KEQALSRLTN GRRFIRRLDE
NYQPDQVVDA SMYAPYYFGM VEPNHPIMIS TMEAIEQRLM INGGIARYEN DMYQRRKAQP
NPWIITTLWV AQYMIDTSRL DKAKDLLTWV MKRATPSGFL PEQVDPETWE STSVIPLVWS
HAELIITLNK YHGKY