Gene Msed_1418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1418 
Symbol 
ID5104628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1384523 
End bp1385857 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content48% 
IMG OID640507307 
ProductAlpha-amylase 
Protein accessionYP_001191500 
Protein GI146304184 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00043111 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00411269 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTAGCA AGGTAATAAT GGGCTTTGAG GTTCATCAAC CCTTCAGAAT CAGGAAAGAC 
GCCTTCTGGA ATCCTAGGTT TAAGGGATCC CCGCAGGAGA GATACTTCGA CGATAAATTA
AACAGGGAAA TATTTGAGCG CGTGAGGGCC AAGTGCTACA TTCCCGCAAC AAACATCATC
CTTGAGGAGA TCGAGGCGGG AGAAGATGAG GGAAGGGAGG TCAAGTTCTT CTTCTCTGTG
TCTGGGACCC TGTTGGAACA GGCTGAGAGA TGGGGGAGGG ATTTTCTAGA TTTACTCGAA
TTGCTCTCGA GCACGCGAAA GGTCGAGTTC CTAGCTCAGA CTTACTATCA CTCCGTTACC
TCGCTGTGGG AGGATAGGAC AGAGTGGAGG GAACAGGTAA AGCTTCACGT TGAAACAGTT
AAGTCACTCT TAGGTCAGAC CCCAGTTACG TTCGAGAATA CCGAGCTACT CACTAGTCCC
GTGATAGTGG AGGAGGCAGA GAACATGGGT TTCAATGGTA TCATGATGGA GGGAAAGGAC
TCCGTGTTGA GGGGGAGATC ACCAAACTTC GTCTACAGGA GAAAGGGAGG TAAAATCTCG
ATCCTACCAA GAAACTTCAC CTTGAGTGAT GACGTCGCAT TCAGGTTCTC CAATCCCAAC
TGGGACCAGT ATCCCTTAAC CGCGGAGAAG TACTCCTCGT GGGTTAAGGC CTCCCCAGGT
CAGGTAGTCA CTATTTTCGT GGATTATGAG ACATTTGGAG AGCATCACTG GAAGGAGAGT
GGAATCCTGG AGTTCCTAAG ATGGTTGCCC AGGGAACTCA ACAGGGAGGG AGTGGAGATG
ACCCTACCAA GGGAGGTAGA GGGCAGTCCC TACTATGACC TTGAGGTTAG CGGAATATCC
TCATGGGCAG ACATCAGAAA GGATCACACA AGTTGGTTGG GTAACATAAT GCAGTGGGCC
TACGACGAGG CAGTTAGGAG ATCTGAGATG ACCTCGAAGG AACTAGGAGG AGAATTTCTA
AGGGCGTGGA GATACTTCAC CACGAGTGAT AACTACTACT ATTTGTTCAC TGAGGGTGGT
GGTCCAGGCG AGGTTCACTC GTATTTCAAC GCTTATAATT CCCCGATAGA TGCCTTCCTA
AACGAGTTCT ATGCCATTAA CTCCTTTCTT CATGACGAAC TTGAAAATCT AGGAATCAAG
AATGAGCCTT TCTTCTTCTA CAAGGATGGG AAGAGAGTTG GGGTAGCTTG GGATGAGAAC
CAGTTCATGG AAATAGTGAG GCGCGATGAA TCACTTAAGG ATCACCTGAA GTACTTGAAG
GAGTGGTTGC AATGA
 
Protein sequence
MTSKVIMGFE VHQPFRIRKD AFWNPRFKGS PQERYFDDKL NREIFERVRA KCYIPATNII 
LEEIEAGEDE GREVKFFFSV SGTLLEQAER WGRDFLDLLE LLSSTRKVEF LAQTYYHSVT
SLWEDRTEWR EQVKLHVETV KSLLGQTPVT FENTELLTSP VIVEEAENMG FNGIMMEGKD
SVLRGRSPNF VYRRKGGKIS ILPRNFTLSD DVAFRFSNPN WDQYPLTAEK YSSWVKASPG
QVVTIFVDYE TFGEHHWKES GILEFLRWLP RELNREGVEM TLPREVEGSP YYDLEVSGIS
SWADIRKDHT SWLGNIMQWA YDEAVRRSEM TSKELGGEFL RAWRYFTTSD NYYYLFTEGG
GPGEVHSYFN AYNSPIDAFL NEFYAINSFL HDELENLGIK NEPFFFYKDG KRVGVAWDEN
QFMEIVRRDE SLKDHLKYLK EWLQ