Gene GM21_3577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3577 
Symbol 
ID8138950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4159026 
End bp4160171 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content66% 
IMG OID644871197 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_003023356 
Protein GI253702167 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones142 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCCG GACTGTACCT GCATTTCCCC TTTTGCCTGA AAAAGTGCCT CTACTGCGAT 
TTCAACTCGA CTCCCGCTGG AGGCTCCGAC CACCGCGGCT ACGTGGAGCT GCTCCTGAAG
GAGATGGAGC TCAGGCAAGC CGCACTTCCC GAGCCGGTCA TCGCGCCGAC CCTTTACCTC
GGGGGGGGGA CCCCCTCCCT GATGGCGCCG GAACTGGTGG GGCTGATCGT CGACGCGGCG
CGAAAGAGCT TCTCCCTGGA GCCGGACGCG GAGGTCACCC TGGAGGCGAA CCCGGGGACG
CTCACGCCCG AGCGCCTGCA GGGGTACCGG TCAGCCGGGG TGAACCGCCT GTCGCTGGGG
ATACAGTCCT TCGAGGACCG CCTGCTTAAG CGGCTGGGGC GCGTGCACAG CTGCGCCGAG
GCCCTCTCCG CCTACCAGGA TGCGCGCCGC GCCGGTTTCG ACAACATCAG CATCGACCTG
ATGCACTCCC TGCCGGGGCA ATCCCTTTCT CAGTGGCGCG AGGCGCTGGA ACTTGCCGTC
TCGCTAGCTC CCGAGCACGT CTCCGCCTAC GCACTCTCCA TAGAGGAGGG TACGCCCTTC
GAGCGGCTCC ACGACGTAGG GGAGCTTCCC CTTCCCGGCG AAGAGGAGGC GGCCGCCATG
TTCGAGGCGA CCGCGGCCGT TCTCTCCAAG GCGGGATACC GCCATTACGA GATCTCCAAC
TACGCGAAAC CCGGACGCCA TTCCCGCCAC AACTCCGCCT ACTGGAGCAG GCTTAGCTAC
CTTGGCTTTG GCGCGGGGGC GCATTCGTTT TGGAACCCCG ACGGGCTCGG GCGCCGCTGG
AAGAATCCAG GCGAAGCGGC GGCGTATGCC GCGGGAATCC GGGTGGGCAT CCCCGCGGAC
GAGGAGCCGG AGCAGCTAAC CCTGGAGGAC GCGCTCTCCG AGTGCTTCTT TCTCGGCCTG
AGGGTGCTCG ACGGGCTCGA CCTGGCTCCG CTTTTCGAGC GCTACGGGGA GAGCGCGCTG
GCGCCGCACC TTAAGCAGGT GGCGGCGCTG GAAAAAAAGG GCGCTCTCAC CCGGGAGGGA
AGCCGCATCC GCATCTCCCC CGACGCCGTC ATACTCGCCA ACGGCATCTT CTCCAGCTTC
GTCTAA
 
Protein sequence
MRAGLYLHFP FCLKKCLYCD FNSTPAGGSD HRGYVELLLK EMELRQAALP EPVIAPTLYL 
GGGTPSLMAP ELVGLIVDAA RKSFSLEPDA EVTLEANPGT LTPERLQGYR SAGVNRLSLG
IQSFEDRLLK RLGRVHSCAE ALSAYQDARR AGFDNISIDL MHSLPGQSLS QWREALELAV
SLAPEHVSAY ALSIEEGTPF ERLHDVGELP LPGEEEAAAM FEATAAVLSK AGYRHYEISN
YAKPGRHSRH NSAYWSRLSY LGFGAGAHSF WNPDGLGRRW KNPGEAAAYA AGIRVGIPAD
EEPEQLTLED ALSECFFLGL RVLDGLDLAP LFERYGESAL APHLKQVAAL EKKGALTREG
SRIRISPDAV ILANGIFSSF V