Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1570 |
Symbol | |
ID | 5322428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1666800 |
End bp | 1668173 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640790514 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001327246 |
Protein GI | 150396779 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.301132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.755317 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGA CTTGGACTCC GAACAGCTGG CGGCAAAAAC CCATTCAGCA GGTGCCGGAT TATCCGGACC TCGCAGCGCT CGACAGCGTG GAAGCGCGTC TTGCCAAATA TCCGCCGCTT GTCTTTGCAG GAGAGGCGCG CCGTCTGAAG AGTGCGCTTG CCAATGTCGC CGACGGCCGT GGTTTTCTGC TGCAGGGGGG CGATTGCGCC GAGAGCTTTG CCGAACACGG CGCCGATACG ATCCGGGACT TCTTCCGCGC CTTCCTGCAG ATGGCCGTCG TCCTGACCTT CGGGGCGCAG CAACCGGTCG TCAAGGTCGG CCGTATTGCC GGCCAATTCG CCAAGCCGCG CTCCTCGGGT ATCGAGAAGC AGGGTGACGT ATCGCTGCCG AGCTACCGCG GCGACATCAT CAACGGGATC GAGTTCACGG AGGAGGCACG CGTGCCCAAT CCGGAGCGCC AGGTCATGGC CTATCGCCAG TCCGCGGCTA CGCTCAACCT GCTTCGTGCC TTTGCCATGG GTGGTTACGC CAACCTGGAA AACGTCCATC AATGGATGCT CGGTTTCGTC AAGGACAGCC CGCAGGCGGA GCGTTATCGC AAGCTTGCCG ACCGGATCTC CGAGACGATG GATTTCATGA AGGCGATCGG CATTACCGCC GAAAACCATC CGAGCCTGCG CGAAACGGAT TTCTTCACCA GCCACGAGGC GTTGCTGCTT GGGTACGAGC AGGCGCTCAC GCGCGTCGAT TCCACCTCAG GCGACTGGTA CGCGACGTCG GGCCATATGA TCTGGATCGG AGATCGCACG CGCCAGCCGG ATCATGCGCA TATCGAGTAT TGCCGTGGCA TCAAGAATCC GCTCGGCCTG AAGTGCGGTC CGTCGCTGAC GGCGGACGGA CTTCTCGAAC TTATCGACAT CCTCAATCCG GCGAACGAGG CGGGACGGCT TACGCTGATC TGCCGCTTCG GGCACGACAA GGTCGCCGAG CATCTGCCGC GCCTCATCCG CGCCGTCGAG CGGGAGGGCA AGAAGGTGGT GTGGTCCTGC GATCCGATGC ACGGCAACAC GATCACGCTC AACAATTACA AGACCCGGCC GTTCGAGCGT ATCCTGTCGG AAGTCGAGAG CTTCTTTCAG ATTCATCGCG CCGAGGGCTC GCATCCCGGC GGCATCCATA TCGAAATGAC CGGCAACGAC GTGACGGAAT GCACCGGCGG TGCACGCGCG CTTTCCGGCG ACGACCTTGC CGACCGCTAC CACACGCATT GCGATCCGCG CCTCAATGCC GATCAGGCAC TCGAGCTTGC CTTCCTGCTC GCCGAGCGCA TGAAGGGCGG CCGCGACGAG AAGAAGATGG TGGTGAACGG CTGA
|
Protein sequence | MAQTWTPNSW RQKPIQQVPD YPDLAALDSV EARLAKYPPL VFAGEARRLK SALANVADGR GFLLQGGDCA ESFAEHGADT IRDFFRAFLQ MAVVLTFGAQ QPVVKVGRIA GQFAKPRSSG IEKQGDVSLP SYRGDIINGI EFTEEARVPN PERQVMAYRQ SAATLNLLRA FAMGGYANLE NVHQWMLGFV KDSPQAERYR KLADRISETM DFMKAIGITA ENHPSLRETD FFTSHEALLL GYEQALTRVD STSGDWYATS GHMIWIGDRT RQPDHAHIEY CRGIKNPLGL KCGPSLTADG LLELIDILNP ANEAGRLTLI CRFGHDKVAE HLPRLIRAVE REGKKVVWSC DPMHGNTITL NNYKTRPFER ILSEVESFFQ IHRAEGSHPG GIHIEMTGND VTECTGGARA LSGDDLADRY HTHCDPRLNA DQALELAFLL AERMKGGRDE KKMVVNG
|
| |