Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3142 |
Symbol | |
ID | 2688427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 3449646 |
End bp | 3450665 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637127835 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | NP_954183 |
Protein GI | 39998232 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCATCG TCATGAAAGC GGGAGCGGCA AAAAAAGACC GGGACGAGGT AATCAGGCGG ATCAGGGAGC TGGGCTACAA GCCCCATGTC ATCCATGGCA CCACGCGGGA CGTGATCGGG GCCGTGGGGG ACGAGCGGGG CAAGTTGGTC CTTCAGGGCG TCGAATCCAT GCACGGCGTG GAGAGCGTGG TGCCGATCCT GAAGCCGTAC AAGCTGGCAT CAAGGGAGGT GAAGCCCGAA CCGAGTGTTG TCGCCATTAC GGATACCGTG GTCATCGGCG GCCCTGAAGT CATTGTCATG GCCGGTCCCT GTTCGGTCGA GGGAGAGGCC ATGATTATCG AGACGGCCAG GGCGGTAAAG GCGGCCGGCG CCCAGGTGCT TCGGGGCGGA GCGTTCAAGC CGCGCACCTC TCCCTACTCC TTCCAGGGAC TCGAAGAGGA AGGGCTCAAG CTGCTGGCCA AGGCGCGCGA GGAGACGGGT CTCCCCATCG TGACCGAGGT GGTCAATCCC GAGACGGCCG AACTGGTGTC CGAGTACTCC GACATTCTCC AGATCGGAGC CCGCAATGCC CAGAACTTTG CCCTGCTCAA GAAGGTGGGG CAGCTTCGGC GCCCGGTGCT CCTCAAGCGG GGCATGTCCA TGACCATCCA GGAGTTCCTC ATGAGCGCCG AGTATGTCAT GAGTGAAGGG AATCAGTCGG TTATCCTCTG CGAACGCGGT ATCCGCACCT TCGAGACCGC CACCCGCAAC ACGCTGGATC TCTCGGCCAT TCCGGTGTTG AAGCAGATGA CCCATCTGCC GGTCATTGCC GATCCTTCCC ACGGCACCGG AAACTATCAC TATGTGGCAC CCATGGCCCT GGCAGCTGTT GCCGCCGGCG CGGACGGGCT CATGATCGAG GTGCATCCCG ATCCCGAGCG TGCGTCGTCC GATGGGCCCC AGTCGCTCAA GCCGAAGAAA TTCGACGCGC TTATGGCAAA GTTGCGGCTC GTGGCGGCGT CCGTGGACAG GCGACTGTAA
|
Protein sequence | MIIVMKAGAA KKDRDEVIRR IRELGYKPHV IHGTTRDVIG AVGDERGKLV LQGVESMHGV ESVVPILKPY KLASREVKPE PSVVAITDTV VIGGPEVIVM AGPCSVEGEA MIIETARAVK AAGAQVLRGG AFKPRTSPYS FQGLEEEGLK LLAKAREETG LPIVTEVVNP ETAELVSEYS DILQIGARNA QNFALLKKVG QLRRPVLLKR GMSMTIQEFL MSAEYVMSEG NQSVILCERG IRTFETATRN TLDLSAIPVL KQMTHLPVIA DPSHGTGNYH YVAPMALAAV AAGADGLMIE VHPDPERASS DGPQSLKPKK FDALMAKLRL VAASVDRRL
|
| |