Gene GSU0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0437 
SymbolubiD 
ID2687261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp469239 
End bp470654 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content69% 
IMG OID637125103 
Product3-octaprenyl-4-hydroxybenzoate carboxy-lyase 
Protein accessionNP_951496 
Protein GI39995545 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAACG GTGATCTTCA CGATTTTCTT GCCGAGTTGG AACGGCTCGG TGACCTTCAT 
CGGGTGGCGG CTGAAGTCGA CCCGGTGCTT GAGGTGGCCG CCATCACCGA CCGGGTGAGC
AAGCTGCCGG GCGGGGGGAA GGGGCTTCTT TTCGAGCGCG TGAAGGGCTC CCGCTTCCCA
CTGGTCACCA ACGTCTTCGG CTCGGCCGGA CGGGTGGCAG CCGCCCTTGG CGCGGCAACC
CTCGACGACT TGACGGCGCG GATGGAGTCG CTCCTCGCCG CCGTGCCCCC CGCGAAAACG
GCATCCCCCC TGGAGGCGAT GGCCTCCCTG GCCGAGATGC GGCGCTTTGC CCCCATGATC
GTGGAGAAGG CGCCCTGCCA GGAGGTGGTG GAGGCGCCGG ACCTCCTGCG CTACCCGTTT
CCGCACTCCT GGCCCGGCGA CGGTGGCCGC TTCATCACTC TTCCCCTTGT TGTTACCCGT
GACCCGGAGA CGAACACTCC CAACTGCGGC ATGTACCGCG TGCGGGTGGT GGATGGAGCC
AGCGCCGGCA TCCGGTGGTA TGCGGGGAAG GGGGGCGAGC TCCACTGTCG GCGCCATCGG
GAGCGCGGGG AGCGGATGCC CGTGGCCGTG GCCATCGGCG GCGATCCGGC GGCAATCCTG
GCGGCGATCC TTCCCCTTCC CGAGGGGTTC GACGAAATGC TCTTTGCTGG TTTTCTCCGG
GGGCGCCCCC TGGAGCTGGC GCGCTGCCGG ACCAGCGACC TGCTGGTCCC GGCCGGGGCG
GAACTGGTTC TGGAAGGCTA TGTGGAGCCC GGCGAGACGG TCATGGACGG CGCCTTCGGC
AACCATACGG GCTTTTACGC CCCGGCGGCA TCGGTGCCCC TCATGCGGCT CACCTGCATA
ACGCGCCGCG CCGATTGCCT CTGCCCGGCC ACGGTGGTGG GCCGGCCCCC CATGGAGGAC
TGCTATCTGG CCAAGGCGGC GGAGCGGCTC CTGCTGCCGG TGCTCAGGAT GCGCTGGCCC
GAGATCGTCG ACATCAACTA TCCGCTGGAA TGGATATTCA ACGGAGGCGC TGTTCTTTCG
CTGCGGGAAT GCTCTTCCTC CCGGGTCCGC CGGATCGTCA CGGAACTTTG GAGTTCGGGG
CTTGCCGGGC CCGGCAGGCT TCTGGTGGCG GTGGACGAGG GGACGCGGGT CGATGATCCC
GCCGACGTGG CTTGGCGGGC CATGAACGCG GTGGACTGGC GGACCGATCT GATCATTGCC
GAGCGTGACG CCGCAGCCGC CTGGCCCGGC CTCGGTTCGC GGCTGGCCAT CGACGCCACC
CGATCCGCTG CCCGCCGCTA CGGAGCAGAA GAACTCGTGC CGGACCGGGA GACGGCCCGG
CGTGTGGACT CGCGTTGGAG GGAATACGGA TTCTGA
 
Protein sequence
MANGDLHDFL AELERLGDLH RVAAEVDPVL EVAAITDRVS KLPGGGKGLL FERVKGSRFP 
LVTNVFGSAG RVAAALGAAT LDDLTARMES LLAAVPPAKT ASPLEAMASL AEMRRFAPMI
VEKAPCQEVV EAPDLLRYPF PHSWPGDGGR FITLPLVVTR DPETNTPNCG MYRVRVVDGA
SAGIRWYAGK GGELHCRRHR ERGERMPVAV AIGGDPAAIL AAILPLPEGF DEMLFAGFLR
GRPLELARCR TSDLLVPAGA ELVLEGYVEP GETVMDGAFG NHTGFYAPAA SVPLMRLTCI
TRRADCLCPA TVVGRPPMED CYLAKAAERL LLPVLRMRWP EIVDINYPLE WIFNGGAVLS
LRECSSSRVR RIVTELWSSG LAGPGRLLVA VDEGTRVDDP ADVAWRAMNA VDWRTDLIIA
ERDAAAAWPG LGSRLAIDAT RSAARRYGAE ELVPDRETAR RVDSRWREYG F