Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1418 |
Symbol | |
ID | 5104628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1384523 |
End bp | 1385857 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507307 |
Product | Alpha-amylase |
Protein accession | YP_001191500 |
Protein GI | 146304184 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1449] Alpha-amylase/alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00043111 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00411269 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTAGCA AGGTAATAAT GGGCTTTGAG GTTCATCAAC CCTTCAGAAT CAGGAAAGAC GCCTTCTGGA ATCCTAGGTT TAAGGGATCC CCGCAGGAGA GATACTTCGA CGATAAATTA AACAGGGAAA TATTTGAGCG CGTGAGGGCC AAGTGCTACA TTCCCGCAAC AAACATCATC CTTGAGGAGA TCGAGGCGGG AGAAGATGAG GGAAGGGAGG TCAAGTTCTT CTTCTCTGTG TCTGGGACCC TGTTGGAACA GGCTGAGAGA TGGGGGAGGG ATTTTCTAGA TTTACTCGAA TTGCTCTCGA GCACGCGAAA GGTCGAGTTC CTAGCTCAGA CTTACTATCA CTCCGTTACC TCGCTGTGGG AGGATAGGAC AGAGTGGAGG GAACAGGTAA AGCTTCACGT TGAAACAGTT AAGTCACTCT TAGGTCAGAC CCCAGTTACG TTCGAGAATA CCGAGCTACT CACTAGTCCC GTGATAGTGG AGGAGGCAGA GAACATGGGT TTCAATGGTA TCATGATGGA GGGAAAGGAC TCCGTGTTGA GGGGGAGATC ACCAAACTTC GTCTACAGGA GAAAGGGAGG TAAAATCTCG ATCCTACCAA GAAACTTCAC CTTGAGTGAT GACGTCGCAT TCAGGTTCTC CAATCCCAAC TGGGACCAGT ATCCCTTAAC CGCGGAGAAG TACTCCTCGT GGGTTAAGGC CTCCCCAGGT CAGGTAGTCA CTATTTTCGT GGATTATGAG ACATTTGGAG AGCATCACTG GAAGGAGAGT GGAATCCTGG AGTTCCTAAG ATGGTTGCCC AGGGAACTCA ACAGGGAGGG AGTGGAGATG ACCCTACCAA GGGAGGTAGA GGGCAGTCCC TACTATGACC TTGAGGTTAG CGGAATATCC TCATGGGCAG ACATCAGAAA GGATCACACA AGTTGGTTGG GTAACATAAT GCAGTGGGCC TACGACGAGG CAGTTAGGAG ATCTGAGATG ACCTCGAAGG AACTAGGAGG AGAATTTCTA AGGGCGTGGA GATACTTCAC CACGAGTGAT AACTACTACT ATTTGTTCAC TGAGGGTGGT GGTCCAGGCG AGGTTCACTC GTATTTCAAC GCTTATAATT CCCCGATAGA TGCCTTCCTA AACGAGTTCT ATGCCATTAA CTCCTTTCTT CATGACGAAC TTGAAAATCT AGGAATCAAG AATGAGCCTT TCTTCTTCTA CAAGGATGGG AAGAGAGTTG GGGTAGCTTG GGATGAGAAC CAGTTCATGG AAATAGTGAG GCGCGATGAA TCACTTAAGG ATCACCTGAA GTACTTGAAG GAGTGGTTGC AATGA
|
Protein sequence | MTSKVIMGFE VHQPFRIRKD AFWNPRFKGS PQERYFDDKL NREIFERVRA KCYIPATNII LEEIEAGEDE GREVKFFFSV SGTLLEQAER WGRDFLDLLE LLSSTRKVEF LAQTYYHSVT SLWEDRTEWR EQVKLHVETV KSLLGQTPVT FENTELLTSP VIVEEAENMG FNGIMMEGKD SVLRGRSPNF VYRRKGGKIS ILPRNFTLSD DVAFRFSNPN WDQYPLTAEK YSSWVKASPG QVVTIFVDYE TFGEHHWKES GILEFLRWLP RELNREGVEM TLPREVEGSP YYDLEVSGIS SWADIRKDHT SWLGNIMQWA YDEAVRRSEM TSKELGGEFL RAWRYFTTSD NYYYLFTEGG GPGEVHSYFN AYNSPIDAFL NEFYAINSFL HDELENLGIK NEPFFFYKDG KRVGVAWDEN QFMEIVRRDE SLKDHLKYLK EWLQ
|
| |