Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3615 |
Symbol | |
ID | 8138988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4195208 |
End bp | 4196233 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871235 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_003023394 |
Protein GI | 253702205 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 127 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTAG TCATGAAAGC AGGAGCGGCT AAAAAAGAAC GCGACGCGGT GACCAAGAGG ATCAAGGAAC TGGGGTACAC CCCGCACATC ATCCATGGCA CCACCCGCGA CGTGATCGGC GCCGTCGGGG ACGAGCGTGG CAAGAGCGTC TTGCAGACGC TGGAGTCGAT GCAGGGGGTC GAGAGCGTCG TCCCCATCCT GCAGCCCTAC AAGCTCGCCT CCAAGGAAGT GAAGAAAGAG GCCAGCATCG TCCGCATCTC CGACACCCTC GCCATCGGCG GGAAAGAACT GGTGGTGATG GCTGGGCCCT GCTCCGTCGA GAACGAGGAG CAGATCATCG AGTCGGCCCT CGCCGTCAAG GCGGCCGGCG CCCAGATGCT GCGCGGCGGC GCCTTCAAGC CGCGTACCTC CCCCTACTCC TTCCAGGGAC TGGAGGAAGA GGGGCTGAAG CTTTTGGCGA AGGCCCGTGA GATAACCGGC CTCCCCTTCG TCACCGAGAT CATCGATCCC GATTCCGTCG ACCTCGTCGC CTCCTACTCA GACATGCTCC AGATCGGGGC GCGCAACTCC CAGAACTTCG CCCTCCTGAA GAAAGTCGGG CAGATCGACA AGCCGATCCT TTTGAAGCGC GGCATGTCGA TGACCATCCA GGAGTTCCTG ATGAGCGCCG AGTACATCAT GAGCGAGGGG AACCAGTCCG TCATCCTCTG CGAGCGCGGC ATCAGGACCT TCGAGACCGC CACCAGGAAC ACCCTCGATC TCTCCGCCAT TCCCGTTTTG AAGTCGAAGA CGCATCTTCC CATCGTCATC GACCCCTCCC ACGGCACCGG CAACTACCAC TACATAGCGC CCATGTCCTT CGCCGCGGTT GCCGCCGGCG CCGACGGTCT CATCATCGAA GTGCACCCCG ATCCGGAGAA GGCCTCCTCC GACGGGCCGC AGTCGCTCAA ACCTAAGAAG TTCCAGGCAT TGATGGACAA GCTGCGGCTC TTCGCGGAAG CCGCGGACAA GACACTCGGC GCGTAG
|
Protein sequence | MIVVMKAGAA KKERDAVTKR IKELGYTPHI IHGTTRDVIG AVGDERGKSV LQTLESMQGV ESVVPILQPY KLASKEVKKE ASIVRISDTL AIGGKELVVM AGPCSVENEE QIIESALAVK AAGAQMLRGG AFKPRTSPYS FQGLEEEGLK LLAKAREITG LPFVTEIIDP DSVDLVASYS DMLQIGARNS QNFALLKKVG QIDKPILLKR GMSMTIQEFL MSAEYIMSEG NQSVILCERG IRTFETATRN TLDLSAIPVL KSKTHLPIVI DPSHGTGNYH YIAPMSFAAV AAGADGLIIE VHPDPEKASS DGPQSLKPKK FQALMDKLRL FAEAADKTLG A
|
| |