Gene SeD_A3447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3447 
Symbol 
ID6872939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3310174 
End bp3311310 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID642786441 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_002217079 
Protein GI198243140 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.286445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAC TACCGCCGCT CAGTCTTTAT ATTCATATTC CCTGGTGTGT ACAAAAATGT 
CCATATTGCG ACTTCAATTC CCATGCGTTG AAGGGCGAGG TGCCACATGA CGACTACGTC
CAGCATCTGT TAAACGATCT GGATGCCGAC GTCGCCTGGG CGCAAGGGCG TGAAGTAAAG
ACCATTTTTA TTGGCGGCGG TACGCCAAGC CTGCTTTCCG GGCCGGCGAT GCAAACGCTG
CTGGACGGCG TGCGTGCGCG CCTGAATCTG GCGGCGGATG CGGAAATTAC TATGGAAGCG
AACCCCGGCA CGGTCGAAGC CGACCGCTTC ATCGACTATC AGCGCGCCGG CGTAAACCGG
ATCTCCATTG GCGTGCAGAG CTTTAGTGAG CCTAAGCTGA AACGTCTTGG CCGTATTCAC
GGTCCACAAG AGGCGATACG GGCAGCAAGA CTGGCAAATG GACTTGGGCT ACGCAGCTTT
AACCTCGACT TGATGCATGG ATTGCCGGAT CAAACGCTGG AAGAGGCGCT GAATGATTTG
CGACAGGCGA TTGCGCTTAA TCCGCCGCAT CTCTCATGGT ATCAATTGAC GATTGAACCC
AACACTTTGT TCGGTTCGCG TCCGCCGGTT TTACCGGACG ATGACGCTCT GTGGGATATC
TTTGAGCAGG GCCACCAGTT ATTAACCGCC GCGGGCTATC AGCAATACGA AACGTCGGCC
TATGCCAAAC CCGGTTATCA GTGCCAGCAT AATCTGAACT ACTGGCGCTT TGGCGACTAT
CTGGGTATTG GCTGCGGCGC CCACGGTAAA GTCACCTTCC CGGGCGGCAG GATCCTGCGC
ACCACCAAAA CCCGTCACCC ACGCGGTTAT ATGCAGGGAC GTTACCAGGA AAGCCAGCGT
GACGTGAGTG ATGATGACAA ACCCTTTGAG TTCTTTATGA ACCGTTTTCG GTTGCTGAAG
CGCGCGCCTC GCGCCGAATT TGTCGATTAT ACCGGGCTTA CGGAAGCGGT TATTCGTCAG
CCAATCGACG AGGCTATTGC CCAGGGCTAC CTGACCGAAT GCGAGCAATA CTGGCAGATT
ACCCGGCACG GTAAACTGTT TTTAAACTCT CTTCTTGAGT TGTTTCTCGC GGAATAA
 
Protein sequence
MAKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLNDLDAD VAWAQGREVK 
TIFIGGGTPS LLSGPAMQTL LDGVRARLNL AADAEITMEA NPGTVEADRF IDYQRAGVNR
ISIGVQSFSE PKLKRLGRIH GPQEAIRAAR LANGLGLRSF NLDLMHGLPD QTLEEALNDL
RQAIALNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA
YAKPGYQCQH NLNYWRFGDY LGIGCGAHGK VTFPGGRILR TTKTRHPRGY MQGRYQESQR
DVSDDDKPFE FFMNRFRLLK RAPRAEFVDY TGLTEAVIRQ PIDEAIAQGY LTECEQYWQI
TRHGKLFLNS LLELFLAE