Gene GSU3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3333 
Symbol 
ID2687649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3662036 
End bp3663046 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID637128027 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionNP_954373 
Protein GI161579496 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0338957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGATTG TCATGAACCA CAAGGCTGGA CCGAAACAAA TCGAGGCGGT AGTGAAGGCG 
GTGGAGCAGA TGGGACTTAC GGCGGCGCCC ATTCCCGGCA GCGAACGGAC AGCCATCGGC
GTCCTCGGCA ATCACGGGTA TGTGGATGAT ACCACCATCC GGGATCTGCC CGGCGTTCAG
GAGGTCATCC ATGTCTCCAA ACCCTATAAG CTCGTTTCCC GCGACTTCCA CCCGCGCCAT
ACGGTGGTGA AGGTGGGCGA CGTGGCCATC GGCGAGGGGA AGCGCCCTGT AGTGGTGGCC
GGCCCCTGCG CCGTGGAAGG GGAGGAGCAG ATCGTCCGGA CCGCGCGGGC GGTGAAAAAA
TACGGAGCTG ATCTTCTGCG GGGGGGCGCC TTCAAACCCC GCACCGGTCC CCATACCTTC
CAGGGGCTGC GGGAGGAAGG GCTGAAGCTT CTGGCCATTG CCCGCCGGGA GACGGGACTT
CCCATCGTGA CCGAGGTCAT GAGTCCCGAC ACGGTGGGAC TTGTGGCGGA ATATGCCGAC
CTCCTCCAGG TTGGCGCGCG CAACATGCAA AACTTCGAAC TGCTCAAGGA GTTGGGCCGA
ATCCGCAAGC CAGTGCTCCT CAAGCGGGGG ATGAGTGCTA CTCTGGAGGA ATTTCTGGCC
GCGGCCGAAT ACATTTTGGC TGAGGGCAAC GGCCAGGTGA TCCTCTGCGA GCGGGGGATC
CGGACCTTCG AGACCGCCAC CCGCAATACC CTCGACCTGG CGGTGGTGCC CCTCATCCGG
GAGATGACCC ATCTGCCGGT CATGGTTGAC CCCTCCCACG CCACCGGAAA GCGGAGCCTC
GTGGCGCCCA TGGCCAAGGC GGCGCTGGTG GCAGGAGCCC ACGGTGTCCT CGTGGAGGTC
CACCCGGAGC CGGACAAGGC CCTCTCGGAC GGCCCCCAGT CTCTCACTTT CCACGGCTTC
GAGGCACTCA TGGGCGAGAT CCGGCGGCTC AACGAGTTCC TCGGCTTCTG A
 
Protein sequence
MLIVMNHKAG PKQIEAVVKA VEQMGLTAAP IPGSERTAIG VLGNHGYVDD TTIRDLPGVQ 
EVIHVSKPYK LVSRDFHPRH TVVKVGDVAI GEGKRPVVVA GPCAVEGEEQ IVRTARAVKK
YGADLLRGGA FKPRTGPHTF QGLREEGLKL LAIARRETGL PIVTEVMSPD TVGLVAEYAD
LLQVGARNMQ NFELLKELGR IRKPVLLKRG MSATLEEFLA AAEYILAEGN GQVILCERGI
RTFETATRNT LDLAVVPLIR EMTHLPVMVD PSHATGKRSL VAPMAKAALV AGAHGVLVEV
HPEPDKALSD GPQSLTFHGF EALMGEIRRL NEFLGF