Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_4302 |
Symbol | |
ID | 5166809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 4969645 |
End bp | 4970673 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640551781 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_001233018 |
Protein GI | 148266312 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000170599 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTATCG TCATGAGCTA CAATGCCGGA GAGGAAGAGA TCGATGCCGT CGTCAAGGCG GTTGAGGAGA TGGGCTATAA GGCCAAGCCG ATTCCCGGCG GTGAACGGAC CGCTATCGGC GTTCTGGGGA ATACCGGCTA TGTGGACGAT GTGATCATCA GGGACCTGCC CGGTGTGCAG GAGGTTATCC ATGTCTCCAA ACCCTACAAG CTTGTTTCCC GGGCTTTTCA CCCGCAGAGC AGCATCATAA ATGTTTGCGG GGTGGAAATC GGCGAGGGGT GCCGACCGGT TGTCGCCGCT GGTCCATGCG CGGTGGAAAG TGAAGAGCAG ATCGTCAAAA CCGCCCTGGC GGTCAAGGCG GCCGGCGCCG ATCTGTTACG CGGCGGCGCT TTCAAGCCGA GAACCGGCCC CCATACGTTT CAGGGGTTAA GAGAAGAAGG TTTGCGACTG TTGGCCCTGG CCGGCAAAGA GAGTGGCCTC CCCATCGTCA CTGAGGTGAT GAGCCCGGAA AGTGTCGGGA TTGTGGCGGA ATACGCCGAC CTCCTCCAGG TAGGTGCGCG TAATATGCAG AACTTTGACC TGTTGCGGGA GGTGGGCCGT ATCGAGAAAC CGGTCCTCCT CAAGCGGGGG ATGAGCGCTA CCATCGAAGA GTTTCTTGCT GCCGCGGAAT ACATCCTGGC GGAGGGGAAT CCCAACGTCA TCCTTTGCGA GCGCGGCATT CGCACCTTCG AAACCGCTAC CCGCAATACC CTCGACCTTT CGGTGGTGCC GCTCATCAAG GAATTGTCCC ATCTGCCCAT CATGGTTGAT CCCTCCCATG CCACCGGCAA ACGAAGCCTT GTCCCTCCCA TGTCGAAAGC CGCCCTGGTA GCGGGAGCCC ACGGCATTCT CGTTGAGGTT CATCCGGAAC CGGAGAAAGC GCTTTCCGAT GGTCCGCAAT CTTTGACTTT CCAGGGCTTT GACAAGCTGA TGGAGGAGGT AAGAAAGCTT AACCAGTTCC TTGGCTACGG CGCTGAAAAA GACGCTTGA
|
Protein sequence | MLIVMSYNAG EEEIDAVVKA VEEMGYKAKP IPGGERTAIG VLGNTGYVDD VIIRDLPGVQ EVIHVSKPYK LVSRAFHPQS SIINVCGVEI GEGCRPVVAA GPCAVESEEQ IVKTALAVKA AGADLLRGGA FKPRTGPHTF QGLREEGLRL LALAGKESGL PIVTEVMSPE SVGIVAEYAD LLQVGARNMQ NFDLLREVGR IEKPVLLKRG MSATIEEFLA AAEYILAEGN PNVILCERGI RTFETATRNT LDLSVVPLIK ELSHLPIMVD PSHATGKRSL VPPMSKAALV AGAHGILVEV HPEPEKALSD GPQSLTFQGF DKLMEEVRKL NQFLGYGAEK DA
|
| |