Gene GSU0012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0012 
SymbolhemG 
ID2685243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp20296 
End bp21705 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content65% 
IMG OID637124674 
Productprotoporphyrinogen oxidase 
Protein accessionNP_951074 
Protein GI39995123 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG CGATTGTGGC CGGCGGCGGC ATCTCAGGGC TCGCCACCGC GTACCTGCTG 
AAAACCCGGG CCGCGGAGGA AGGACTTGAG CTCGACGTGA CCCTGGTGGA GCGGGAGGAA
CGCCTGGGGG GCAAAATCTG GAGCATCAAG GAGGAGGGGT ATCTCTGCGA GTGGGGCCCC
AACGGTTTTC TGGACTCCAA ACCCCAGACC CTCGACCTCT GCCGGGAACT GGGCGCGTCT
GACCTGCTCC TGCGGAGCAA CGACAACGCC CGCAAGCGGT TCATCTACAC CGGCGGGGCG
CTGAACCGCC TACCCGAGAA CGGACCCATG TTTCTCAAAA GCGGTCTCAT CTCCTGGCCG
GGCAAGCTGC GGCTCGCCAT GGAACCGTTC ATTCCGAAAA AAGCGGGCGA CGAGGACGAA
ACCCTGGCGG CCTTCGGCCG GCGCCGCCTG GGGGACGAAG CGCTGCGCAA GCTGATTGCG
CCCATGGTGT CGGGGATTTT TGCCGGCAAT CCGGAAACCA TGTCCCTGCG GTCGTGCTTT
CCCCGCATCG CCGAGTTGGA GGATGAATAC GGCAGCTTGG TGCGGGCCAT GATCCGTCTG
GCGAAAAAGA AGAAGCAGGA GGTCGCTCAA GGAAAGGCGG TGGCCAGCGC CGCCGGACCG
GGCGGGGTGC TCACCTCGTT CCGGGACGGC ATCCAGGCCC TCACCGATAT CCTGGCCGAG
CGTCTCGGTC CGGACACTAT CGTATCGGGC CAGGAAGTGC TGGAAGTTTC GCGGGGCGGA
AGCCTCCCCT GGCGGGTGCG GACCGGAAGC ATCGATATGG ACGCCGATCT GGTGATCCTG
GCGACCCCCG CCTATGCCAC CGCCTCCATC ATTCAGGGAG TGGACTCCGA CATGGCCGGC
ATTCTCCGGC AGATCCCCTA CGCCACCATG ACCGTTGTCT GCTTCGGATA TGACCGGGAG
CGGATCGCCC ACGATCTGAA CGGCTTCGGC TATCTCATTC CAAAGGAGGA GGGGATGAAT
ACCCTGGGCA CGCTCTGGGA TTCGAGCATC TTCGAGAACC GGGCGCCGGA AGGTCAGGTC
CTCCTGCGCA GCATGATGGG GGGGGCCTGC TTCCCCGAAT ACGTCAACCT GACCGACGAG
GAGGTCACTG GGCGGGTGAA GAACGACCTC GCCACCATCA TGGGCATCAC GGCGCCTCCT
TCGTTCGTCC GCATCTTCCG CCATCACCAG GCCATCCCCC AGTACACCGT GGGGCACTCC
ACACGCGTAG CCGCTCTGGA GCAGAGAGCC GCCTCCCTGC CGGGACTTTT CCTCACCGGC
AACTCTTACC GGGGTATCGG CCTCAACGAC TGCGTGGCCG CCGCCAACCG CACCGCCGGC
GAGGCCATCG CCCAGCTCAC ATCCCGCTGA
 
Protein sequence
MKKAIVAGGG ISGLATAYLL KTRAAEEGLE LDVTLVEREE RLGGKIWSIK EEGYLCEWGP 
NGFLDSKPQT LDLCRELGAS DLLLRSNDNA RKRFIYTGGA LNRLPENGPM FLKSGLISWP
GKLRLAMEPF IPKKAGDEDE TLAAFGRRRL GDEALRKLIA PMVSGIFAGN PETMSLRSCF
PRIAELEDEY GSLVRAMIRL AKKKKQEVAQ GKAVASAAGP GGVLTSFRDG IQALTDILAE
RLGPDTIVSG QEVLEVSRGG SLPWRVRTGS IDMDADLVIL ATPAYATASI IQGVDSDMAG
ILRQIPYATM TVVCFGYDRE RIAHDLNGFG YLIPKEEGMN TLGTLWDSSI FENRAPEGQV
LLRSMMGGAC FPEYVNLTDE EVTGRVKNDL ATIMGITAPP SFVRIFRHHQ AIPQYTVGHS
TRVAALEQRA ASLPGLFLTG NSYRGIGLND CVAAANRTAG EAIAQLTSR