Gene Saro_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3336 
Symbol 
ID3915983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3558219 
End bp3559232 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content67% 
IMG OID640446121 
Productferrochelatase 
Protein accessionYP_498605 
Protein GI87201348 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGCC CCGCAGACCA CCCCACTGTT CTCACCGGCA AGGTCGGGGT GCTTCTGGTC 
AACCTCGGCA CGCCCGATGC GCCCGATGCG GGCGCGGTGA AGCGCTATCT CAAGGAGTTC
CTGTCCGACC GCCGTGTGGT GGAAATTCCC GCGCTCGTCT GGCAGCCGAT CCTGCGTGGC
ATCATCCTGA ACACGCGCCC CCGGAAATCA GCGCACGCCT ACGCCCAGGT ATGGACGGAT
GAGGGATCGC CGCTGGCCGC GATCACGGCC GCGCAGGCAC GCGCGCTTCA GGCACGGCTG
GGCGAAAGCG CGATCGTGCG GCACGCAATG CGCTACCAAT CGCCCGCCAT GGCGAAAGAG
CTGGACGCGC TGCTGCAGGC CGGGTGCGAG CGCATTCTCG TCGCGCCGCT CTACCCGCAC
TATTCGGGGG CGACGACGGC TTCCGCGCTC GATGCGGTGG CAGACTGGAT CAAGGCGCGT
CGCCGCCTTC CCGCACTGCG CACCCTGCCG CCTTATCACG ACGATCCGGC CTATATCGGC
GCGCTTCACG CCGACCTCTC GCGCCAGATC GACGCTCTCG ACTTCGCGCC CGAACTGCTG
CTGCTGAGTT ATCACGGCAT GCCCGAACGG ACGCTGCACT TGGGCGACCC CTACCACTGC
CACTGCCGCA AGACCTCGCG CCTGCTGGGC GAGCGTTTTG CGCAGAGCAA TCCGGCGCTG
CGGCTGGAGA CCACGTTCCA GTCGCGTTTC GGCAAGGCAA AGTGGCTCGA GCCTGCAACC
GATGCCGTGC TGGTCGACGA AGCTCGCAAG GGCACGCGCC GCATCGCCAT CGCCGCGCCC
GGGTTTTCCG CCGATTGCCT GGAAACGCTC GAGGAACTGG CGATCCGCGG CAAGGAAGAT
TTCGTCGCGG CGGGTGGTAC GCACTTCGCC TCGCTCGCCT GTCTCAATGC GGGCGACGAC
GGCATGGACA TGATCGAGGC GCTGGTCCGG CGCGAGCTTT CGGGCTGGAT CTGA
 
Protein sequence
MQRPADHPTV LTGKVGVLLV NLGTPDAPDA GAVKRYLKEF LSDRRVVEIP ALVWQPILRG 
IILNTRPRKS AHAYAQVWTD EGSPLAAITA AQARALQARL GESAIVRHAM RYQSPAMAKE
LDALLQAGCE RILVAPLYPH YSGATTASAL DAVADWIKAR RRLPALRTLP PYHDDPAYIG
ALHADLSRQI DALDFAPELL LLSYHGMPER TLHLGDPYHC HCRKTSRLLG ERFAQSNPAL
RLETTFQSRF GKAKWLEPAT DAVLVDEARK GTRRIAIAAP GFSADCLETL EELAIRGKED
FVAAGGTHFA SLACLNAGDD GMDMIEALVR RELSGWI