Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3333 |
Symbol | |
ID | 2687649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 3662036 |
End bp | 3663046 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637128027 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | NP_954373 |
Protein GI | 161579496 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0338957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGATTG TCATGAACCA CAAGGCTGGA CCGAAACAAA TCGAGGCGGT AGTGAAGGCG GTGGAGCAGA TGGGACTTAC GGCGGCGCCC ATTCCCGGCA GCGAACGGAC AGCCATCGGC GTCCTCGGCA ATCACGGGTA TGTGGATGAT ACCACCATCC GGGATCTGCC CGGCGTTCAG GAGGTCATCC ATGTCTCCAA ACCCTATAAG CTCGTTTCCC GCGACTTCCA CCCGCGCCAT ACGGTGGTGA AGGTGGGCGA CGTGGCCATC GGCGAGGGGA AGCGCCCTGT AGTGGTGGCC GGCCCCTGCG CCGTGGAAGG GGAGGAGCAG ATCGTCCGGA CCGCGCGGGC GGTGAAAAAA TACGGAGCTG ATCTTCTGCG GGGGGGCGCC TTCAAACCCC GCACCGGTCC CCATACCTTC CAGGGGCTGC GGGAGGAAGG GCTGAAGCTT CTGGCCATTG CCCGCCGGGA GACGGGACTT CCCATCGTGA CCGAGGTCAT GAGTCCCGAC ACGGTGGGAC TTGTGGCGGA ATATGCCGAC CTCCTCCAGG TTGGCGCGCG CAACATGCAA AACTTCGAAC TGCTCAAGGA GTTGGGCCGA ATCCGCAAGC CAGTGCTCCT CAAGCGGGGG ATGAGTGCTA CTCTGGAGGA ATTTCTGGCC GCGGCCGAAT ACATTTTGGC TGAGGGCAAC GGCCAGGTGA TCCTCTGCGA GCGGGGGATC CGGACCTTCG AGACCGCCAC CCGCAATACC CTCGACCTGG CGGTGGTGCC CCTCATCCGG GAGATGACCC ATCTGCCGGT CATGGTTGAC CCCTCCCACG CCACCGGAAA GCGGAGCCTC GTGGCGCCCA TGGCCAAGGC GGCGCTGGTG GCAGGAGCCC ACGGTGTCCT CGTGGAGGTC CACCCGGAGC CGGACAAGGC CCTCTCGGAC GGCCCCCAGT CTCTCACTTT CCACGGCTTC GAGGCACTCA TGGGCGAGAT CCGGCGGCTC AACGAGTTCC TCGGCTTCTG A
|
Protein sequence | MLIVMNHKAG PKQIEAVVKA VEQMGLTAAP IPGSERTAIG VLGNHGYVDD TTIRDLPGVQ EVIHVSKPYK LVSRDFHPRH TVVKVGDVAI GEGKRPVVVA GPCAVEGEEQ IVRTARAVKK YGADLLRGGA FKPRTGPHTF QGLREEGLKL LAIARRETGL PIVTEVMSPD TVGLVAEYAD LLQVGARNMQ NFELLKELGR IRKPVLLKRG MSATLEEFLA AAEYILAEGN GQVILCERGI RTFETATRNT LDLAVVPLIR EMTHLPVMVD PSHATGKRSL VAPMAKAALV AGAHGVLVEV HPEPDKALSD GPQSLTFHGF EALMGEIRRL NEFLGF
|
| |