Gene SNSL254_A3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3351 
Symbol 
ID6482342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3252447 
End bp3253583 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content55% 
IMG OID642738642 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_002042363 
Protein GI194443015 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.315269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAC TACCGCCGCT CAGTCTTTAT ATTCATATTC CCTGGTGTGT ACAAAAATGT 
CCATATTGCG ACTTCAATTC CCATGCGTTG AAGGGCGAGG TGCCGCATGA CGACTACGTC
CAGCATCTGT TAAGAGATCT GGACGCCGAT GTTGCCTGGG CGCAAGGGCG TGAAGTAAAG
ACCATTTTTA TTGGCGGCGG TACGCCAAGC CTGCTTTCCG GGCCGGCGAT GCAAACGCTG
CTGGACGGCG TGCGCGCGCG CCTGAATCTG GCGGCGGATG CGGAAATTAC CATGGAAGCG
AACCCCGGCA CGGTCGAAAC CGACCGCTTC ATCGACTATC AGCGCGCCGG CGTAAACCGG
ATCTCCATTG GCGTGCAGAG CTTTAGTGAG CCTAAGCTGA AACGCCTTGG CCGTATTCAC
GGTCCACAAG AGGCGATGCG GGCAGCAAGA CTGGCAAATG GGCTTGGGCT ACGCAGCTTT
AACCTCGACT TGATGCATGG ATTGCCGGAC CAAACGCTGG AAGAGGCGCT GAACGATTTG
CGACAGGCGA TTGCGCTTAA TCCGCCGCAT CTCTCATGGT ATCAATTGAC GATTGAACCC
AACACTTTGT TCGGTTCGCG TCCGCCGGTT TTACCGGACG ATGACGCACT GTGGGATATC
TTTGAGCAGG GCCACCAGTT ATTAACCGCT GCTGGCTATC AGCAATACGA AACGTCGGCC
TATGCCAAAC CCGGTTATCA GTGCCAGCAT AATCTGAACT ACTGGCGCTT TGGCGACTAT
CTGGGGATTG GCTGCGGCGC CCACGGTAAA GTCACTTTCC CGGGCGGCAG GATCCTGCGC
ACCACCAAAA CCCGTCACCC ACGCGGTTAT ATGCAGGGAC GTTACCTGGA AAGCCAGCGT
GACGTGAGTG ATGACGATAA ACCCTTTGAG TTCTTTATGA ATCGTTTTCG GTTGCTGGAG
CGCGCGCCTC GCGCCGAATT TGTCGCCTAT ACCGGGCTTA CGGAAGCGGT TATTCGTCAG
CCAATCGACG AGGCTATTGC CCAGGGCTAC CTGACCGAAT GCGAGCAATA CTGGCAGATT
ACCCGGCACG GTAAACTGTT TTTAAACTCT CTTCTTGAGT TGTTTCTCGC GGAATAA
 
Protein sequence
MAKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLRDLDAD VAWAQGREVK 
TIFIGGGTPS LLSGPAMQTL LDGVRARLNL AADAEITMEA NPGTVETDRF IDYQRAGVNR
ISIGVQSFSE PKLKRLGRIH GPQEAMRAAR LANGLGLRSF NLDLMHGLPD QTLEEALNDL
RQAIALNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA
YAKPGYQCQH NLNYWRFGDY LGIGCGAHGK VTFPGGRILR TTKTRHPRGY MQGRYLESQR
DVSDDDKPFE FFMNRFRLLE RAPRAEFVAY TGLTEAVIRQ PIDEAIAQGY LTECEQYWQI
TRHGKLFLNS LLELFLAE