Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1875 |
Symbol | aroB |
ID | 5104143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1818086 |
End bp | 1819132 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640507761 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001191939 |
Protein GI | 146304623 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.722069 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGT TCCGTGAGAA GGTGTGCTGT TCAGATGTGA GCGTGAAAGT TGGTAGGGGG GCACTCAGGG AGCTGGAGAG CCTTCCAGGG AGGAAGTGCA TAGTTCACCC CAAGTCCTTG AAGCCAGACG TGAAGGGGGA CCTAGAAATC GCGGTAGAGG ACGGAGAGAA AGGGAAGGAC CTGAGGAACG CGCTTGAGAT AGTGGATAAA CTCCTGGAAC ACGACTTCAC AAGGGGCGAT TACCTAGTTG CTGTGGGGGG AGGGACAGTA CTCGACGTGG CGGGCTTCTC AGCCTCAATC TTCATGCGTG GCCTAAACCT AGTTAACGTT CCCACCACCC TCTTGGGGAT GGTAGACGCA GGGATCGGTG GAAAGACGGG AGTTAATTAC GGTAAGGCAA AGAACATGAT CGGGACCTTT TATCAACCCT CACTTATCCT AGATGACCTC TCCTTTCTGG ATACTCTCCC CACGGAGGAG CTGAGAAGGG GACTCGCGGA GGTTGTGAAG TATGCACTGG TCCTAGATAA GGAACTTTAC GACTTTCTTT CCTTGAATCA CAGCTCAGTC CTGAACAAGG AGGAATCAGC CCTGGAAAAG GTAATCTCTT CGTCAGTTAG GGACAAGTTA GCGGTCGTTG CGGAGGACGA GAGGGAGACC AAGGGAGTGA GGATAGTCCT GAACTTCGGG CATACCATAG GGCATGCAAT CGAGGCTGGC TCAGATTTCA CGGTTCCTCA CGGTCTCGCA ATATCCGTTG GAATGGTATG TGAGGCTAAG ATTGCCGAGG AAATGGGCTA CGCTGAGGAG GGAGTGGTTG AGGATGTCCT TTGGTTACTG CAACTCTTTG GGTTACCCAT CTCCTTGGAG CAACTTAACG CGAAAATTGA TGTGGAGAAG GCGTTAATTG CCATGACAAA GGACAAGAAG AGGAGAGGGG AAGAAGTCCT CCTACCCTTT CCCACTAGGA TTGGGAATTG GAGGGGGGTA AGGGTGCCAC TTGAGACCCT TGAGGGTTTC GCTAAGCAAT GCTTGGGAGG TAATTGA
|
Protein sequence | MIEFREKVCC SDVSVKVGRG ALRELESLPG RKCIVHPKSL KPDVKGDLEI AVEDGEKGKD LRNALEIVDK LLEHDFTRGD YLVAVGGGTV LDVAGFSASI FMRGLNLVNV PTTLLGMVDA GIGGKTGVNY GKAKNMIGTF YQPSLILDDL SFLDTLPTEE LRRGLAEVVK YALVLDKELY DFLSLNHSSV LNKEESALEK VISSSVRDKL AVVAEDERET KGVRIVLNFG HTIGHAIEAG SDFTVPHGLA ISVGMVCEAK IAEEMGYAEE GVVEDVLWLL QLFGLPISLE QLNAKIDVEK ALIAMTKDKK RRGEEVLLPF PTRIGNWRGV RVPLETLEGF AKQCLGGN
|
| |