Gene Daro_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1048 
Symbol 
ID3568211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1149253 
End bp1150668 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content56% 
IMG OID637679510 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_284274 
Protein GI71906687 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTGCC TAATGAACTT CGCAACAGAA AACCTCGTCT TCGATCCGCA GATCATTCGC 
CGCTTCGACG TCAACGGCCC GCGCTACACG TCCTATCCAA CGGCTGATCG CTTTGTTGAG
GCTTTCGACT CGGAGGCTGC CAAGCTGTGG CTCGGAAAGC GCAATATTGG CGGGATCAGC
CGACCGCTCT CATTATACTT CCACATCCCT TTCTGTAACA CTATTTGCTA TTACTGCGCC
TGTAACAAGA TCATCACCAA GGATCATGGG CGCAGCGCCA AATACCTGAA ATATCTGGCC
AAGGAACTCG AGATTCAGGC GGCGGCACTG GAAGGCCGCG ACGGTGAGCA CGAGGTCATC
CAGTTGCATT GGGGTGGCGG TACGCCGACC TTCCTGTCGC ACAGCGAAAT GCGCCAGTTG
ATGGGCGAAA CCCGCAAGCA CTTCAAGTTG CTCGATGGCG GCGAATATTC GATTGAAGTC
GACCCCCGCA AGGTGGATAC GGCCACGGTC GCTCTGCTGG GTGAGCTGGG TTTCAACCGC
ATGAGCGTCG GCGTTCAGGA TTTCGACGAA AAGGTACAAG TTGCCGTCAA TCGCGTTCAG
AGCGAGGAAG AAACCTACAG CGTCATCCGT GATGCGCGGG CCAACGGCTT CAAGTCAGTT
TCTGTCGACC TGATCTACGG TCTGCCGCAT CAGACGGTGA TGGGGTTCAA CCGGACGCTG
GAGCGCGTTC TGGCGATGGA TCCTGACCGT CTGTCGATCT ACAACTATGC GCACATGCCC
AGCATGTTCA AGCCGCAGCG CCGGATCAAC GAAGGTGATC TGCCCTCAGC CGATACCAAG
CTGCAGATTC TGGCGCTGGC GATCAAGAAA CTGACCGATG CGGGTTATGT CTTCATCGGC
ATGGACCACT TTGCCAAGCC GGATGACGAA CTGGCAGTTG CCCAGCGTCA GGGCCGCCTG
CACCGTAATT TCCAGGGCTA TTCGACTTAC GCCGATTGCG ACATGCTGTC TTTCGGCATC
TCTTCGATCA GCAAGGTCGG GCCGACCTAT TACCAGAACG TCAAGACGGC GGACGAGTAC
TACGATCGTC TGGATACCGA TACGCTGCCG GTTTTCCGCG GTATCGAGCT GACGGCTGAC
GATATCCTGC GTCGTTCGAT CATCCAGGCG TTGATGTGCC ATTTCGAGTT GTCCATCGAG
AGCATCGAAA GCGCCCATCT GATCGACTTC CACAAGTATT TCGCAGCCGA ACTGGAAGAC
ATGAAGGAAA TGGAGCGGGC CGGTTTGCTC AAGATCGATC GCGAGTGGAT CACCGTACTG
CCACCAGGAC GCCTGCTGGT TCGCATCATT TCCATGGTTT TTGATCGCTA TCTGCGGGCA
GGGCGCCAGC GGGCAACCTA CTCCAAAGTC ATCTGA
 
Protein sequence
MACLMNFATE NLVFDPQIIR RFDVNGPRYT SYPTADRFVE AFDSEAAKLW LGKRNIGGIS 
RPLSLYFHIP FCNTICYYCA CNKIITKDHG RSAKYLKYLA KELEIQAAAL EGRDGEHEVI
QLHWGGGTPT FLSHSEMRQL MGETRKHFKL LDGGEYSIEV DPRKVDTATV ALLGELGFNR
MSVGVQDFDE KVQVAVNRVQ SEEETYSVIR DARANGFKSV SVDLIYGLPH QTVMGFNRTL
ERVLAMDPDR LSIYNYAHMP SMFKPQRRIN EGDLPSADTK LQILALAIKK LTDAGYVFIG
MDHFAKPDDE LAVAQRQGRL HRNFQGYSTY ADCDMLSFGI SSISKVGPTY YQNVKTADEY
YDRLDTDTLP VFRGIELTAD DILRRSIIQA LMCHFELSIE SIESAHLIDF HKYFAAELED
MKEMERAGLL KIDREWITVL PPGRLLVRII SMVFDRYLRA GRQRATYSKV I