Gene Msed_1875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1875 
SymbolaroB 
ID5104143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1818086 
End bp1819132 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content52% 
IMG OID640507761 
Product3-dehydroquinate synthase 
Protein accessionYP_001191939 
Protein GI146304623 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.722069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGT TCCGTGAGAA GGTGTGCTGT TCAGATGTGA GCGTGAAAGT TGGTAGGGGG 
GCACTCAGGG AGCTGGAGAG CCTTCCAGGG AGGAAGTGCA TAGTTCACCC CAAGTCCTTG
AAGCCAGACG TGAAGGGGGA CCTAGAAATC GCGGTAGAGG ACGGAGAGAA AGGGAAGGAC
CTGAGGAACG CGCTTGAGAT AGTGGATAAA CTCCTGGAAC ACGACTTCAC AAGGGGCGAT
TACCTAGTTG CTGTGGGGGG AGGGACAGTA CTCGACGTGG CGGGCTTCTC AGCCTCAATC
TTCATGCGTG GCCTAAACCT AGTTAACGTT CCCACCACCC TCTTGGGGAT GGTAGACGCA
GGGATCGGTG GAAAGACGGG AGTTAATTAC GGTAAGGCAA AGAACATGAT CGGGACCTTT
TATCAACCCT CACTTATCCT AGATGACCTC TCCTTTCTGG ATACTCTCCC CACGGAGGAG
CTGAGAAGGG GACTCGCGGA GGTTGTGAAG TATGCACTGG TCCTAGATAA GGAACTTTAC
GACTTTCTTT CCTTGAATCA CAGCTCAGTC CTGAACAAGG AGGAATCAGC CCTGGAAAAG
GTAATCTCTT CGTCAGTTAG GGACAAGTTA GCGGTCGTTG CGGAGGACGA GAGGGAGACC
AAGGGAGTGA GGATAGTCCT GAACTTCGGG CATACCATAG GGCATGCAAT CGAGGCTGGC
TCAGATTTCA CGGTTCCTCA CGGTCTCGCA ATATCCGTTG GAATGGTATG TGAGGCTAAG
ATTGCCGAGG AAATGGGCTA CGCTGAGGAG GGAGTGGTTG AGGATGTCCT TTGGTTACTG
CAACTCTTTG GGTTACCCAT CTCCTTGGAG CAACTTAACG CGAAAATTGA TGTGGAGAAG
GCGTTAATTG CCATGACAAA GGACAAGAAG AGGAGAGGGG AAGAAGTCCT CCTACCCTTT
CCCACTAGGA TTGGGAATTG GAGGGGGGTA AGGGTGCCAC TTGAGACCCT TGAGGGTTTC
GCTAAGCAAT GCTTGGGAGG TAATTGA
 
Protein sequence
MIEFREKVCC SDVSVKVGRG ALRELESLPG RKCIVHPKSL KPDVKGDLEI AVEDGEKGKD 
LRNALEIVDK LLEHDFTRGD YLVAVGGGTV LDVAGFSASI FMRGLNLVNV PTTLLGMVDA
GIGGKTGVNY GKAKNMIGTF YQPSLILDDL SFLDTLPTEE LRRGLAEVVK YALVLDKELY
DFLSLNHSSV LNKEESALEK VISSSVRDKL AVVAEDERET KGVRIVLNFG HTIGHAIEAG
SDFTVPHGLA ISVGMVCEAK IAEEMGYAEE GVVEDVLWLL QLFGLPISLE QLNAKIDVEK
ALIAMTKDKK RRGEEVLLPF PTRIGNWRGV RVPLETLEGF AKQCLGGN