Gene CPR_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1901 
Symbol 
ID4204740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2099452 
End bp2100894 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content29% 
IMG OID642566451 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_699211 
Protein GI110802256 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATATTA AAATATATCT TAATGACTTA AAGTATAGAT ATGACGTTTA TCAAATGTTT 
AATATATTCT ATACTTTTAA GGAATTAAAG TTTGTAAATG AAGATGAAGA GAGAGACTAC
GATGTCTTTA TATCTGAAAA TATGGTTAAA ATTTCAGAGG GAGACAATAG TTTTTCTTAT
GAATTCAAAG AAGGATACGG ATTTAAAACT GAACTTAAAA AGGGAATATT CAAATTTTTA
TCTGAAACTC TTAAGGATGA ATATCCTTGG GGAACATTAG TTGGAATAAG ACCAAGTAAA
ATAGCATTAT CTCTAATAAG AGAAGGAAAA TCTGAGGAAG AGATAATAAA ATATTTTGAA
GATAATTATA TGGCTAGGGA AGAAAAAGCT AAGCTTTGCA TAGAAGTTGC AGAAAGAGAA
GAAAGTTTTG TAAACAAAGA GGAAAAAAAC ATAAGTATAT ATGTTGGTAT GCCTTTTTGT
CCTACAAGAT GCCTTTATTG TTCCTTTGCA GCAAATCCTA TAGCTGGATG CAAGAAAGAT
GTTGAGCCTT ATTTAGAAGC TTTAAGCAAA GAAATTTCAG CTATAAGTGA TTATGTATCA
AAGAAAGGCT TAAAGATAGA AACTGTTTAT TTTGGTGGAG GCACTCCAAC CTCAGTAAAT
AATGAACAGT TTGAAGTATT AATGAAACAT ATATATGATA GTTTCGTTAA TAATAAAGGA
ATAAAGGAGT TCACTGTTGA ATGTGGAAGA CCTGATTCTA TAACTGAAGA AAAATTAAAA
ACTATGAAGA GATATGAAGT ATCTAGAATA TCTATAAATC CTCAAAGTAT GAACGATAAA
ACCTTGAAAT CAATAGGTAG AGGGCATTTA ACAGAGGATG TAGTGGATAA ATTCAATTTG
GCAAGAAGCT TAGACTTTGA TAACATAAAT ATGGATATTA TAATAGGTCT TCCAAATGAA
GATATTTCAG AGGTTTCTAA AACATGCTCT ATGATAAAGG AGCTTAATCC AGATAGTTTA
ACTATTCATG GTATGTCTAT TAAAAGGGCA TCAAGACTTC ATGAAAATTT AGTTTTACAT
AATACTATAA CTATTGCAGA GCAAAAAAAT CTTAATAAGA TGTATGAGAT GAGTAAAGTT
TTAGGTAGAG AACTAAATAT GCATCCATAT TATATGTATA GACAAAAAAA TATGGTTGGT
AATATGGAGA ATGTAGGATA TTCAAAAGAT AACAAGGAAT GTATCTACAA TATTCAAATG
ATTGAAGATA AGCAAACTAT AATTGCACTA GGAGCAGATG CCGTTTCTAA GGTAGTGTTT
TTAGAAGAAG ATAAAAATCG TATAGAAAGA TTTGCAAATG TTAAAGATGT AAAGGAATAT
GTAAAAAGAA TAGAGGAAAT GGTTGAAGGT AAGATAGAAT TACTTGATAC TTTATATAAA
TAA
 
Protein sequence
MYIKIYLNDL KYRYDVYQMF NIFYTFKELK FVNEDEERDY DVFISENMVK ISEGDNSFSY 
EFKEGYGFKT ELKKGIFKFL SETLKDEYPW GTLVGIRPSK IALSLIREGK SEEEIIKYFE
DNYMAREEKA KLCIEVAERE ESFVNKEEKN ISIYVGMPFC PTRCLYCSFA ANPIAGCKKD
VEPYLEALSK EISAISDYVS KKGLKIETVY FGGGTPTSVN NEQFEVLMKH IYDSFVNNKG
IKEFTVECGR PDSITEEKLK TMKRYEVSRI SINPQSMNDK TLKSIGRGHL TEDVVDKFNL
ARSLDFDNIN MDIIIGLPNE DISEVSKTCS MIKELNPDSL TIHGMSIKRA SRLHENLVLH
NTITIAEQKN LNKMYEMSKV LGRELNMHPY YMYRQKNMVG NMENVGYSKD NKECIYNIQM
IEDKQTIIAL GADAVSKVVF LEEDKNRIER FANVKDVKEY VKRIEEMVEG KIELLDTLYK