Gene Msed_1876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1876 
Symbol 
ID5104144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1819129 
End bp1820121 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content55% 
IMG OID640507762 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_001191940 
Protein GI146304624 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTCT ACATATTGAA GGGCAAGGAT CACTCCTCAC TAAAGGAGAA ACTTAAGTCT 
AGCTCAGCCT CATTCAAGTT CCTCAACCTG TACGGAAAGG AGCTCGCACT GGCCTGGCCC
GACTCTGCAG TTGAGGGGAT AACGGATGAA TCCGTGGAGC TAGTGGTGAA GACCAAGAAG
TCCTACATCC TGGCTGGAAA CGAATGGAAA AAGGATCCAA CAGTGGTGAA GGTGAAGGAC
GTGGAGATAG GATCCAAGAG GGTTGTGGTC GCTGCCGGGC CATGTGCGGT GGAGTCCATG
GAGCAGACCG AGACCGTGGC CAAGGCCGTG AAAAGGGCTG GTGCCTCACT ACTCAGGGGT
GGAGCCTACA AGCCTAGGAC GAGCCCCTAC TCCTTCCAGG GACTGGGAGA GGAAGGGCTA
AAGATACTCA GGAAGGCTGG CGACGAGACA GGGTTACCTG TGGTTTCAGA GATCCTTGAC
GCGAGGGACG CGGGAGCCTT TGCAAAGTAT GCTGACATGG TCCAGATAGG CGCTAGGAAC
TCGCAGAACT TCACCCTTTT GCGGGATGTG GGAAAGCTGG GCAAACCCGT CTTGCTAAAG
AGAGGTCTAG GGAACACGGT GGAGGAACTA ATACAGTCTG CGGAATACGT AATGATGGAG
GGGAACGGCA ACGTGGTCCT CTGCGAAAGG GGAATAAGAA CCTTTGAGAA GTCCACCAGG
TTCACCCTAG ATATTGGAGG AATGGTTGCG GGGAAGCTAA TGACGCACCT CCCCTTCTGC
GCGGATCCGA GTCATCCTGC GGGGAAGAGG GAACTGGTTC ACTCCCTTGC CCTAGCCTCT
GTGGCAGCCG GGGCAGACAT GCTCCTTGTG GAAGTTCATC CCAGGCCTGA GGTGGCACTG
AGCGACTCTG AACAGCAACT GACCCCGGAG TCCTTTGAAT TATTGATGGA GAGAGTCAAG
GCATTGGCTT CGGTTCTAGG TAGATCCGCA TGA
 
Protein sequence
MILYILKGKD HSSLKEKLKS SSASFKFLNL YGKELALAWP DSAVEGITDE SVELVVKTKK 
SYILAGNEWK KDPTVVKVKD VEIGSKRVVV AAGPCAVESM EQTETVAKAV KRAGASLLRG
GAYKPRTSPY SFQGLGEEGL KILRKAGDET GLPVVSEILD ARDAGAFAKY ADMVQIGARN
SQNFTLLRDV GKLGKPVLLK RGLGNTVEEL IQSAEYVMME GNGNVVLCER GIRTFEKSTR
FTLDIGGMVA GKLMTHLPFC ADPSHPAGKR ELVHSLALAS VAAGADMLLV EVHPRPEVAL
SDSEQQLTPE SFELLMERVK ALASVLGRSA