Gene Sare_3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3249 
SymbolhemH 
ID5705400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3742189 
End bp3743220 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content73% 
IMG OID641272677 
Productferrochelatase 
Protein accessionYP_001538044 
Protein GI159038791 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACG ACGCGGTGAT GCTGGTTTCC TTCGGCGGGC CCGAGCGGCC CGAGGACGTG 
ATGCCCTTCC TGCAGAATGT GACCCGGGGC CGGGGCGTGC CGCCGGAGCG GTTGGCCGAG
GTCGCCGAGC ACTACCTGCA CTTCGGTGGG GTGTCGCCGA TCAACCAGCA GTGCCGCGAG
CTCCTCGCCG CGATCCGGGA GGACTTCGCT GCCAACGGTG TCGACCTCCC GGTCTACTGG
GGTAACCGGA ACTGGGATCC GATGCTCGCC GACACCGTGG CGCGGATGCA TGACGACGGC
GTCGAGCGGG CCCTGGCGTT CGTGACCAGC GCCCTCGGCG GGTACTCGTC CTGTCGGCAG
TACCAGGAGG ACATCGCTGC GGCCCGGGCG GCGGTCGGCC CGGACGCCCC GGTGGTGGAG
AAGCTGCGCC AGTTCTGGGA CCATCCCGGG TTCGTCGAGC CGCACTCCGA CGCGGTGCGG
GCGGCGCTGG CCCAACTGGA CCCGGCGCGA CGGGACAGCA CCCGGATCGT CTTCACCGCC
CACTCGGTCC CCACCTCCGC GGCGGCAGCC GCCGGCCCGC ACGGTGGCCG GTACGAGGCG
CAGTTGGCCG AGACGGCCCG GCTGGTACAC GCCGCTGCCG CCCCCGACCT GGCCTACGAC
CTGGTGTGGC AGAGCCGTTC CGGACCCCCG CAGGTACCCT GGCTGGAGCC GGACGTCAAC
GACCACCTCG TGGCCTTGCC CGCGCAGGGC GTCACCGGTG TCGTGGTCAG CCCGATCGGG
TTCGTCTCCG ACCACCTGGA GGTGGTGTGG GACCTCGATA CCGAGGCGCG GGCGACCGCC
GGGCAGTTGG GCCTGGACTT CGCCCGGGCC GCCACGCCGG GCACCGATCC ACGGTTTGTG
GCGATGGTGC GCGAGCTGGT CCGTGAGCGT ACCGATCCGG CTGGCGCGAC GCTGCGCCGG
CGCCTCGGCG AGTTGCCGAT GTGGGACACC TGCCCGGCGG TCTGCTGCGT TCCGGCCCGC
CGCCCCTCCT GA
 
Protein sequence
MAYDAVMLVS FGGPERPEDV MPFLQNVTRG RGVPPERLAE VAEHYLHFGG VSPINQQCRE 
LLAAIREDFA ANGVDLPVYW GNRNWDPMLA DTVARMHDDG VERALAFVTS ALGGYSSCRQ
YQEDIAAARA AVGPDAPVVE KLRQFWDHPG FVEPHSDAVR AALAQLDPAR RDSTRIVFTA
HSVPTSAAAA AGPHGGRYEA QLAETARLVH AAAAPDLAYD LVWQSRSGPP QVPWLEPDVN
DHLVALPAQG VTGVVVSPIG FVSDHLEVVW DLDTEARATA GQLGLDFARA ATPGTDPRFV
AMVRELVRER TDPAGATLRR RLGELPMWDT CPAVCCVPAR RPS