Gene Msed_1296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1296 
Symbol 
ID5104707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1273931 
End bp1274800 
Gene Length870 bp 
Protein Length289 aa 
Translation table11 
GC content47% 
IMG OID640507185 
Product2-keto-3-deoxygalactonate aldolase / 2-keto-3-deoxy-phosphogalactonate aldolase / 2-keto-3-deoxy-phosphogluconate aldolase / 2-keto-3-deoxygluconate aldolase 
Protein accessionYP_001191378 
Protein GI146304062 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTCG TTGTTCCCAT CCTAACGCCT TTCAATCCAG ACGGGACCAT CAACAGGGAA 
GCCTTGAAGA CCCACGCCAC CAATCTTCTT GAAAAGGGGA TAGATCTAAT CTTCTTGAAC
GGAACGACAG GGTTAGGACC AGCCCTCTCG AAGGAGGAAA AGAAGGAAAC CCTTAGAACC
CTTTCAGACG TGGCTGACAG GGTCATATTC CAGGTAGGAG AGCTTAACTT AAACAATGTG
CTGGAACTTG TTAAGTTCTC CTCTGATTAC GGTGTAAGGG CTATAGCGTC TTACTCCCCC
TACTATTTCC CGAGGCTTCC AGAGAAATGG TTAATCAAGT ATTTTCAGAC TATTGCCTCC
CATTCCTCCC ATCCAGTGTA CCTTTACAAC TACCCGTTGG CCACAGGTTA TGATATATCA
GCTGAACTAC TTTCCAAGTT TGGCCTTGAG TTGGCAGGGG TAAAGGACAC GAATCAGGAC
CTGGCACATT CCATGAAATT CAAAACAACT TTCCCTAAGA TGAAGGTGTA CAACGGTTCT
GATACCCTAG CGTTTTACTC CTTGTTATCA CTCGATGGGA CGGTAGCCTC AATGTCCAAC
TGTCTCCCCA CAGTTTTCGT TGAGATGAAG AGGGCAATCT CACAGGGAGA CGTGAAGAAG
GCCTTAGTTC ACCAGAGACT GATTACCTCC ATTGTGGAGC TAGCAAGGAA ATATGGACAA
CTTGGTGCAC TTTACGTCCT AACTGAAATG ACGCAGGGAT ATTCGGTTGG GAGACCAAGA
CCGCCCATAT TTCCCCTTGA GGAAGGCGAG GAGAGGGAAC TCAAAAAAGA GGTAGAGAAC
TTCATCAAGG GTCTCGGTGT TAAAGCTTGA
 
Protein sequence
MEVVVPILTP FNPDGTINRE ALKTHATNLL EKGIDLIFLN GTTGLGPALS KEEKKETLRT 
LSDVADRVIF QVGELNLNNV LELVKFSSDY GVRAIASYSP YYFPRLPEKW LIKYFQTIAS
HSSHPVYLYN YPLATGYDIS AELLSKFGLE LAGVKDTNQD LAHSMKFKTT FPKMKVYNGS
DTLAFYSLLS LDGTVASMSN CLPTVFVEMK RAISQGDVKK ALVHQRLITS IVELARKYGQ
LGALYVLTEM TQGYSVGRPR PPIFPLEEGE ERELKKEVEN FIKGLGVKA