Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1420 |
Symbol | |
ID | 5104791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1387138 |
End bp | 1388985 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507309 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001191502 |
Protein GI | 146304186 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0134624 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTTG CCAGCATTGG AAACGGAAAA ATGCTAGTGA ACTTTGATGA CCACGGGAGG ATTATTGACC TCTATTACCC CTATATAGGA ATGGAGAACC AAACGTCTGG TATTCCCATA AGGGTTGCGC TCTGGGATGG GAAGAACGTT TATCTGGATG AGTCATGGAA GACCGAGGTC TCGTACGAGG ACGGAACCAA TCTTGTGGAG GTCAAGTGGA CCCTAGATAA CCCAGGACTT GAAATAACTT CCTACAACTT CGTCGACGTG AATGAACCTG TGATGAATTC CATAATAAAG ATACTATCCA GGGATATTGA GGGAAAGCTC AGGCTCTTCT TCGTTCACGA CCTAAACATT TATTCCAATC CCTTTGGAGA TACTGCACTT CTGGACCCTG TTACCTGGTC CATGATTCAC TACAAGTCCA AGAGGTACCT AGGAATCAAG CTCATGTCCA CTGAAATGAA CAACACGGAA TTCTCCGCAA CCAAGGGCGA TCCCTTGGAG GATATAAAGG ACGGGAGATT GGATGGTAGC CCGATCTCTC ACGGAGACGT GAAGTCCGCG GTGGGGGTGG AACTAAACCT CAGGAGTAAA TCCTTCGTGA AGGCCTATTA CGTAATAGGG GCAGCGAGGA ATCTCGAGGA GTTGAGGAGA CTTCTCGGTG AGGCGAACCC AGCCAAGATA GAGAGCAACT TCGTCTCAGT GTTCCAGTTC TGGAAGAGCT GGCTATCTAA GGGTAGCTGG ACATCTGATC ACGAGAGCTG GATATACAAC GTTAGTCTCC TGACCGTGAA GAATCACATG GACATGAATG GATCGATCAT AGCCTCCTCA GATTTCTCCT TTGTGAACAT CTATGGGGAC TCTTATCAAT ACTTCTGGCC TAGAGACGGG GCCATAGCTG CGCACTCGCT GGATGTTGCG GGATATGGAG AACTGGCCAT GAAGCACTTC AACTTTGTGA AGGAGATTGC AAATCCCGAG GGTTATCTAC ACCACAAGTA CAACCCCAAC AGGACGCTTG CAAGCTCGTG GCACCCCTGG CTCTATAACG GGAAGAGGAT CCTGCCAATA CAGGAGGACG AAACTGCTCT CGAGGTCTGG GCCATAGGAA GTCATTACAG GAGGTATAAG GACTTGGACG AACTCACAGA GATTTACAGG AAGTTTGTGA AACCAGCTCT ACAGTTCATG ATGAGGTACA CCGAGGACGG ACTTCCAAAA CCGAGCTTTG ATCTGTGGGA GGAGAGGTAT GGAATTCACC TCTACACTGT GTCAACGGTG TATGGAGGGT TGGTCATGGG AGCAGAACTC GCCAAGGGAA TGGGAGACGA AAGCCTTTCA GAAGACGCTC TAGACGTGGC CAAGACCATG AAGGAACAGG CCCTTTCCAG GTTGACCAAT GGGAGGAGAT TCATCAGGAG GCTAGACGAG AACTATCAGC CCGATCAGGT TGTGGACGCA AGCATGTATG CCCCGTACTA CTTTGGGATG GTGGAACCAA ATCATCCCAT TATGATCTCC ACCATGGAGG CCATAGAACA GAGGTTAATG ATAAACGGTG GAATCGCGAG ATACGAGAAC GACATGTACC AGAGGAGAAA GGCTCAGCCC AATCCCTGGA TAATCACAAC CCTATGGGTT GCTCAATACA TGATAGATAC TTCCAGGCTG GATAAGGCCA AGGATCTCCT GACCTGGGTT ATGAAGAGGG CAACCCCCTC TGGTTTCCTT CCTGAACAGG TTGACCCAGA GACGTGGGAG TCTACCTCCG TCATCCCCCT TGTGTGGTCG CATGCGGAAC TAATAATTAC ATTAAATAAA TACCACGGCA AATACTAA
|
Protein sequence | MRLASIGNGK MLVNFDDHGR IIDLYYPYIG MENQTSGIPI RVALWDGKNV YLDESWKTEV SYEDGTNLVE VKWTLDNPGL EITSYNFVDV NEPVMNSIIK ILSRDIEGKL RLFFVHDLNI YSNPFGDTAL LDPVTWSMIH YKSKRYLGIK LMSTEMNNTE FSATKGDPLE DIKDGRLDGS PISHGDVKSA VGVELNLRSK SFVKAYYVIG AARNLEELRR LLGEANPAKI ESNFVSVFQF WKSWLSKGSW TSDHESWIYN VSLLTVKNHM DMNGSIIASS DFSFVNIYGD SYQYFWPRDG AIAAHSLDVA GYGELAMKHF NFVKEIANPE GYLHHKYNPN RTLASSWHPW LYNGKRILPI QEDETALEVW AIGSHYRRYK DLDELTEIYR KFVKPALQFM MRYTEDGLPK PSFDLWEERY GIHLYTVSTV YGGLVMGAEL AKGMGDESLS EDALDVAKTM KEQALSRLTN GRRFIRRLDE NYQPDQVVDA SMYAPYYFGM VEPNHPIMIS TMEAIEQRLM INGGIARYEN DMYQRRKAQP NPWIITTLWV AQYMIDTSRL DKAKDLLTWV MKRATPSGFL PEQVDPETWE STSVIPLVWS HAELIITLNK YHGKY
|
| |