Gene Msed_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1124 
Symbol 
ID5103596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1053649 
End bp1054809 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content51% 
IMG OID640507017 
ProductD-galactarate dehydratase/Altronate hydrolase domain-containing protein 
Protein accessionYP_001191210 
Protein GI146303894 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2721] Altronate dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACAA TCAAGGGTTA CATAAGGGAG AATGGAGCTG TTGGCGTAAG GAATCACGTC 
CTGGTTCTTC CCTTGGATGA CCTTTCCAAT TCCGCAGCCT TGGGGGTTTC CAAGATAGTT
AACGGTGTCG TGGCTGTTCC TCACCCCTAC GGTAGGTTAC AGTTTGGTAG AGATCTTGAC
CTCCTATTTC ACATCCTTTC AGGGACCGGG GCGAACCCAA ACGTCGCTGG GGTCATCGTA
ATAGGGATTG AGGACAATTG GGCCAATAGG GTGGCAGACG GTATCGCCAA GACAGGTAAA
CCCGTTGAGG TCTTCCCCAT TGAGGGATAC GGTGACCTAA AGACCATTGA GAGGGCCTCA
AGGAAGGCCA AGGAGATGGT TCAGGAGGCA AGCGAGAAAC AGCGCACAGA GGTGGACATT
TCTTCCATTG TTATGAGCGT TAAGTGCGGG GAATCTGACA CTACCTCGGG TTTAGCATCT
AACCCCTCCG TCGGGGTCGT GGTGGATAAG ATGGTTGACC TGGGAGCAGT TGCCATGTTT
GGCGAAACCT CAGAGCTTAC GGGTGCAGAG GACATCGTAG CTGACAAGAT GGCCAACGAA
GCCTTAAGGG AAAAGTTCCT GAAGATCTAT AGGGAGTACA TTGACGTGAT AGAAAGGGAA
GGTGCGGATC TCCTTGGATC CCAGCCCACC CAAGGAAACA TTAAGGGAGG ACTCTCCACG
ATAGAGGAGA AAGCGCTAGG GAACATTCAA AAGCTCGGAC ATAGGAAGGT TAACTGCGTC
CTTGATTACC TAGATCCTCT GGTTAGGGAG AAGCAAGGTA CCCTATGTTT CGTGAACACC
TCATCAGCGG CTGCCGAGGC GGTGACGTTG TTCGCCGCTA AGGGATCAGT GCTCCACCTG
TTCACCACGG GTCAAGGAAA TATTGTGGGT CACCCCTTAA TACCTGTGAT AAAGATAACT
GGCAATCCCA AGACGGCTAG AACCATGAGT GAGCATATAG ATGTGGACGT TTCGGATCTG
CTAGACCTCA AGATCTCGCT AGAGGAGGCT GGAGAGAGGG TGTTCAATTA CATGCTTAGG
GTCATGAACG GAAGGTTAAC TGCCGCCGAG GTACTTCACC ATGAGGAGTT CTCGCCGATA
AAACTATACA TAAGTGCATA A
 
Protein sequence
MMTIKGYIRE NGAVGVRNHV LVLPLDDLSN SAALGVSKIV NGVVAVPHPY GRLQFGRDLD 
LLFHILSGTG ANPNVAGVIV IGIEDNWANR VADGIAKTGK PVEVFPIEGY GDLKTIERAS
RKAKEMVQEA SEKQRTEVDI SSIVMSVKCG ESDTTSGLAS NPSVGVVVDK MVDLGAVAMF
GETSELTGAE DIVADKMANE ALREKFLKIY REYIDVIERE GADLLGSQPT QGNIKGGLST
IEEKALGNIQ KLGHRKVNCV LDYLDPLVRE KQGTLCFVNT SSAAAEAVTL FAAKGSVLHL
FTTGQGNIVG HPLIPVIKIT GNPKTARTMS EHIDVDVSDL LDLKISLEEA GERVFNYMLR
VMNGRLTAAE VLHHEEFSPI KLYISA