Gene GSU2291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2291 
Symbol 
ID2686916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2509036 
End bp2510103 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content60% 
IMG OID637126984 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionNP_953340 
Protein GI39997389 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGCA CGAGCAACCT GAAGATCAAA AGTATTACCC CCATCATTGC ACCGGGGGAA 
TTACGCCAGG TGTTTCCCCA GTCGGAGGAG GCGGCCGAGT TCGTCAACTC CAGCCGCGCC
CATATCAAGA ATATCCTCAA GGGCAAGGAT CCGCGCCTGA TGGTGGTGGT GGGCCCCTGT
TCCATTCACG ATCCGAAATC GGCTCTGGAG TATGCGGGGC GGCTGGCACG CCTGGCGGCC
GAACTGTCGG ATCAGCTATT CATCGTGATG CGGGTCTACT TCGAAAAGCC CCGCACTACC
GTAGGCTGGA AGGGACTCAT CAATGACCCC GACATGAACG GCACCCATCA GATATCCAAG
GGGCTCGGCA TCGCCCGGCG GCTGCTGTCC GAAATAACGG AAATGCTCCT GCCGGTGGCA
ACCGAAATGC TTGACCCCAT CACGCCCGAC TACCTCGCGG ATTGCATCTC CTGGGGAGCC
ATCGGCGCTC GTACCACCGA GAGCCAGACC CACCGCGAGA TGGCCAGCGG CCTCTCGTTC
CCCGTGGGAT TCAAGAACGG CACCGACGGC AATCTCCAGA TAGCCATCGA CGCCATGAAT
GCGGCACTCC ATTCCCACAG CTTTCTCGGC GTCAACCGGG AGGGGCGCAC CTCCATCATT
CAGACCACCG GCAACCCCGA TGTCCACATC GTCCTGCGGG GAGGCAAAAA ACCGAACTAT
TTCCCCGAAG ACATCAGAAA GACCGAAGAG ATGCTGGAAA AGGGGGGGCT CTTCCCCACC
ATCATGGTCG ACTGCAGCCA CGGCAACTCG GAAAAACGCC ACGAGAAGCA GCCCGACGTA
CTCTCTTCCG TCGTGGACCA GATTGCGGCC GGCAACCGCT CCATCTCCGG CGTCATGATC
GAGAGTTTTC TGGAAGAAGG GAACCAGTCG ATCCCCAGAG ATCTCTCAAC CCTCAAGTAC
GGCGTATCCA TCACCGACAA GTGCATTGAC TGGAAGACCA CCGAAACCAT CCTGCGCTCG
GCCCACGACC GCCTCAAGGC CGCGGGAGGC AGGCCCCTGC ACGGGTAA
 
Protein sequence
MIRTSNLKIK SITPIIAPGE LRQVFPQSEE AAEFVNSSRA HIKNILKGKD PRLMVVVGPC 
SIHDPKSALE YAGRLARLAA ELSDQLFIVM RVYFEKPRTT VGWKGLINDP DMNGTHQISK
GLGIARRLLS EITEMLLPVA TEMLDPITPD YLADCISWGA IGARTTESQT HREMASGLSF
PVGFKNGTDG NLQIAIDAMN AALHSHSFLG VNREGRTSII QTTGNPDVHI VLRGGKKPNY
FPEDIRKTEE MLEKGGLFPT IMVDCSHGNS EKRHEKQPDV LSSVVDQIAA GNRSISGVMI
ESFLEEGNQS IPRDLSTLKY GVSITDKCID WKTTETILRS AHDRLKAAGG RPLHG