Gene GSU3453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3453 
SymbolhemE 
ID2688126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3800181 
End bp3801203 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content62% 
IMG OID637128148 
Producturoporphyrinogen decarboxylase 
Protein accessionNP_954493 
Protein GI39998542 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01463] methyltransferase, MtaA/CmuA family
[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0575742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATC GTTTTCTCGA CGCCTGCTGG GGCAAGCCCG TTGACCGTAC TCCCGTGTGG 
CTCATGCGCC AGGCGGGCCG CTATCTCCCC GAATATATGG CAGTACGCTC CAAGTGCACC
TTCCTGGAAC TCTGCAAGAC CCCGGAACTG GCGGCAGAGG TGACCATTCA GCCCATCGAC
ATTCTGAATG TCGACGCGGC GATCCTCTTC TCCGACATCC TCACTCCGGT TGAGCCCATG
GGGCTCAAGC TCGACTTCGT GCCCGGCCCG GTCTTCGAGC ACCCCGTACG TACCATGGCC
GATGTGGAGA AGCTGCGCAT TCCCAACCCC GAGGAAGATG TCCCCTACGT ACTGGACACC
ATCAAGATCC TCCGTCGGGA ACTGGCCGGC AGGGTTCCCC TGATCGGCTT CGGCGGAGCG
CCGTTCACCC TGGCCTGCTA CATGGTTGAA GGCAAGGGTT CCAAGGACTG GGCAAACATC
AAGCGGATGA TGTATGCCGC TCCCGACGTC TATGCCGCCC TCATGGACAA GGTTACCATG
ATGGACATGG AGTACCTGAA CGCCCAGATC AAGGCCGGTG CCCAGGCGAT CCAGATCTTC
GACACCTGGG GCGGGGTCCT CTCCCCCACC GATTACGAGA AGTACGTTCT GCCCTACACC
ACCAAGCTCA TCAACGGCCT GAACCGTCAG AACACGCCGG TGATCCACTT TGTCAAGGGC
GCCGGCACCA TGCTGGAGAC GGTGCAGAAG GCCGGCGGCG ACGTCATGGG GCTCGACTGG
CACGTGAATC TTGGGAAGGC CAGGGACGTG CTCGGTCAGA ACATGGCCGT GCAGGGGAAC
CTGGACCCCA CGGTCCTCTA CGCCCCCAAA GAGGTCATCG AGGCCGAGGT GAAGCGGGTG
CTCGACGAGA ATGCCGGCCG TCCCGGACAC ATCTTCAACC TGGGACACGG CATCCTGCCG
ACAGTGCCGC CGGAAAACGC CATCCACATG GTGGAGTGCG TGCACCGGCT GTCCCAGAAG
TAG
 
Protein sequence
MNNRFLDACW GKPVDRTPVW LMRQAGRYLP EYMAVRSKCT FLELCKTPEL AAEVTIQPID 
ILNVDAAILF SDILTPVEPM GLKLDFVPGP VFEHPVRTMA DVEKLRIPNP EEDVPYVLDT
IKILRRELAG RVPLIGFGGA PFTLACYMVE GKGSKDWANI KRMMYAAPDV YAALMDKVTM
MDMEYLNAQI KAGAQAIQIF DTWGGVLSPT DYEKYVLPYT TKLINGLNRQ NTPVIHFVKG
AGTMLETVQK AGGDVMGLDW HVNLGKARDV LGQNMAVQGN LDPTVLYAPK EVIEAEVKRV
LDENAGRPGH IFNLGHGILP TVPPENAIHM VECVHRLSQK