Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1124 |
Symbol | |
ID | 5103596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1053649 |
End bp | 1054809 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507017 |
Product | D-galactarate dehydratase/Altronate hydrolase domain-containing protein |
Protein accession | YP_001191210 |
Protein GI | 146303894 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2721] Altronate dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACAA TCAAGGGTTA CATAAGGGAG AATGGAGCTG TTGGCGTAAG GAATCACGTC CTGGTTCTTC CCTTGGATGA CCTTTCCAAT TCCGCAGCCT TGGGGGTTTC CAAGATAGTT AACGGTGTCG TGGCTGTTCC TCACCCCTAC GGTAGGTTAC AGTTTGGTAG AGATCTTGAC CTCCTATTTC ACATCCTTTC AGGGACCGGG GCGAACCCAA ACGTCGCTGG GGTCATCGTA ATAGGGATTG AGGACAATTG GGCCAATAGG GTGGCAGACG GTATCGCCAA GACAGGTAAA CCCGTTGAGG TCTTCCCCAT TGAGGGATAC GGTGACCTAA AGACCATTGA GAGGGCCTCA AGGAAGGCCA AGGAGATGGT TCAGGAGGCA AGCGAGAAAC AGCGCACAGA GGTGGACATT TCTTCCATTG TTATGAGCGT TAAGTGCGGG GAATCTGACA CTACCTCGGG TTTAGCATCT AACCCCTCCG TCGGGGTCGT GGTGGATAAG ATGGTTGACC TGGGAGCAGT TGCCATGTTT GGCGAAACCT CAGAGCTTAC GGGTGCAGAG GACATCGTAG CTGACAAGAT GGCCAACGAA GCCTTAAGGG AAAAGTTCCT GAAGATCTAT AGGGAGTACA TTGACGTGAT AGAAAGGGAA GGTGCGGATC TCCTTGGATC CCAGCCCACC CAAGGAAACA TTAAGGGAGG ACTCTCCACG ATAGAGGAGA AAGCGCTAGG GAACATTCAA AAGCTCGGAC ATAGGAAGGT TAACTGCGTC CTTGATTACC TAGATCCTCT GGTTAGGGAG AAGCAAGGTA CCCTATGTTT CGTGAACACC TCATCAGCGG CTGCCGAGGC GGTGACGTTG TTCGCCGCTA AGGGATCAGT GCTCCACCTG TTCACCACGG GTCAAGGAAA TATTGTGGGT CACCCCTTAA TACCTGTGAT AAAGATAACT GGCAATCCCA AGACGGCTAG AACCATGAGT GAGCATATAG ATGTGGACGT TTCGGATCTG CTAGACCTCA AGATCTCGCT AGAGGAGGCT GGAGAGAGGG TGTTCAATTA CATGCTTAGG GTCATGAACG GAAGGTTAAC TGCCGCCGAG GTACTTCACC ATGAGGAGTT CTCGCCGATA AAACTATACA TAAGTGCATA A
|
Protein sequence | MMTIKGYIRE NGAVGVRNHV LVLPLDDLSN SAALGVSKIV NGVVAVPHPY GRLQFGRDLD LLFHILSGTG ANPNVAGVIV IGIEDNWANR VADGIAKTGK PVEVFPIEGY GDLKTIERAS RKAKEMVQEA SEKQRTEVDI SSIVMSVKCG ESDTTSGLAS NPSVGVVVDK MVDLGAVAMF GETSELTGAE DIVADKMANE ALREKFLKIY REYIDVIERE GADLLGSQPT QGNIKGGLST IEEKALGNIQ KLGHRKVNCV LDYLDPLVRE KQGTLCFVNT SSAAAEAVTL FAAKGSVLHL FTTGQGNIVG HPLIPVIKIT GNPKTARTMS EHIDVDVSDL LDLKISLEEA GERVFNYMLR VMNGRLTAAE VLHHEEFSPI KLYISA
|
| |