Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1517 |
Symbol | |
ID | 8136846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1772254 |
End bp | 1773606 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869129 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_003021331 |
Protein GI | 253700142 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.00684484 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTAGCC AGAAATGGAC CAAATCGAGC TGGCGTTCTT TTCGCGCCTT GCAGCAACCG GTATGGCCGG CGGGATCGGC GCTGGAGGAG ACGCAGAAGA CGCTGTCCCA GCTCCCGCCG CTGGTGTTCG CAGGCGAGTG CCAGACCCTG AAGGCGCAGT TGGCCGACGC CGTGGAAGGC CGCGCTTTCG TGCTGCAATG CGGCGACTGC GCCGAGGATT TTTCGCGCTG CACCGGCCCC GACATCAGGG AGCTCTTGAA GGTGATCCTG CAGATGTCGG TGGTGCTCGC CTTTGCCGGC GAGAAGCGGG TGATCAAGAT AGGGAGGATC GCCGGGCAGT ACGCGAAACC GCGCTCGTCC GACACCGAGA TGGTGCACGG GATCGAGCTC CCCAGCTACC GCGGCGACAT GGTGAACAGT CCCGAGCCGA CCCTGGAGGC GAGGACCCCC GACCCGCGCC GAATGCTGGA GGGGTATTAC CGGGCAGCGG CGACGCTGAA CCTGGTTCGC TCCTTCACGC TGGGAGGATA CGCCTCGCTG GAGCGGGTGC AGGCCTGGCA CCGCGCCTCG CTCGATGCCC TTCCGGCCGG GCAGAAGTAC GAGGATCTGG TCCGGCAGAT CTGGAAGACC ATCAACTTCA TGACGGCCAT AGGCCTCGAC CCTCAGCACA CCCCGCAGTT GAACCAGGTG ACGCTCTACA CCTCGCACGA GGCCCTCTTG CTCGACTACG AGGAGGCGCT CACCCGTATG GATTCGACCA GCGGGGGGTG GTACGACTGC AGCGCGCACA TGCTCTGGAT CGGCGACCGG ACGAGACAGC TGGACGGGGC GCATGTCGAG TTCCTGCGCG GGGTGAAAAA CCCGCTGGGG ATGAAGGTGG GGCCAAGCTA CGACATCGAC ACGGTGAAAG CGCTGGCGCA GCGGTTGAAC CCCGACAACG AGCCGGGGCG GCTCACTCTG ATCACCCGCT TCGGCGCCGA CAAGATCGAT TCCTACCTCC CCAGGCTTTT GAAAGAGATG AAGCAGGAGG GGTTCAAGGT GGTCTGGAGC TGCGACCCCA TGCACGGCAA CACCTACCAG AACGAGTATG GCCAGAAATC CAGGAAGTTC GAGGACATCC TGCGGGAAAT CAAGAATTTC CAGCAGATAC ACAAGGCGGA GGGGACCGTT GCCGGAGGGG TGCACCTGGA GCTCACCGGC GACCACGTAA CCGAGTGCAC CGGGGGGAGC CGGCAGCTTT TGGACAAGCA TCTGCACCTG AACTACCAGA CCAACTGCGA TCCGCGCCTG AACGCCGAGC AGAGCGTGGA GCTCGCCTTC GAGCTTGCCG AGATGCTGCA TCCCTGCAAG TGA
|
Protein sequence | MPSQKWTKSS WRSFRALQQP VWPAGSALEE TQKTLSQLPP LVFAGECQTL KAQLADAVEG RAFVLQCGDC AEDFSRCTGP DIRELLKVIL QMSVVLAFAG EKRVIKIGRI AGQYAKPRSS DTEMVHGIEL PSYRGDMVNS PEPTLEARTP DPRRMLEGYY RAAATLNLVR SFTLGGYASL ERVQAWHRAS LDALPAGQKY EDLVRQIWKT INFMTAIGLD PQHTPQLNQV TLYTSHEALL LDYEEALTRM DSTSGGWYDC SAHMLWIGDR TRQLDGAHVE FLRGVKNPLG MKVGPSYDID TVKALAQRLN PDNEPGRLTL ITRFGADKID SYLPRLLKEM KQEGFKVVWS CDPMHGNTYQ NEYGQKSRKF EDILREIKNF QQIHKAEGTV AGGVHLELTG DHVTECTGGS RQLLDKHLHL NYQTNCDPRL NAEQSVELAF ELAEMLHPCK
|
| |