Gene Sbal223_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1801 
SymbolhemH 
ID7088335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2121891 
End bp2122910 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content50% 
IMG OID643460705 
Productferrochelatase 
Protein accessionYP_002357729 
Protein GI217972978 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.452842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000219286 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCTAA CGTTGAATCC TTCCCCTGCT TTTGGTGTGT TATTGGTTAA CCTCGGCACG 
CCTGATGAAC CTAGCCCTAA AGCCGTAAAA CGATTTCTAA GGCAGTTTTT AAGTGATCCC
CGTGTCGTCG ATTTATCGCC GTGGATTTGG CAACCGCTGC TGAACGGCAT TATCCTCAAC
ACCCGTCCGG CGAAAGTCGC CAAACTCTAC CAAAGCGTGT GGACGCCGGA AGGCTCGCCC
TTGATGGTGA TAAGTGCACG CCAAGCACAA AAGCTGGCGG TGGACTTAAG CGCCACGTTT
AACCAGACCA TACCGGTTGA ACTAGGCATG AGTTACGGCA ATCCCTCCAT CGATGCGGGA
TTTGAACGCT TGAAAGACCA AGGGGCAGAG CGCATTATTG TGCTGCCTTT GTATCCCCAA
TATTCCTGCT CGACGGTGGC AAGTGTGTTC GATGCCGTTG CCAGTTATCT TAAAACCGTG
CGTGACATAC CTCAGGTTCG CTTTAATAAA GACTATTTTG ATCATGATGC TTATATAGCG
GCGCTGGCGC ATTCAGTGAC ACGTCACTGG AAAACCCATG GGCAGGCCGA TAAGTTGCTC
TTGTCATTTC ACGGTATTCC GCTGCGTTAT GTGAAAGAGG GCGATCCATA CCGCGAGCAG
TGTTTTGTCA CGGCTAAACT GTTAGCGCAA AAACTTGAGC TTAGCGAGTC GCAATGGCAG
GTGTGCTTCC AATCGCGCTT TGGCCGCGAA GAGTGGCTCA CGCCCTACGC CGATCAATTA
TTGGCCGAGT TACCTGCACA AGGAATCAAA AGTGTCGATG TGATTTGCCC AGCGTTTGCT
ACTGACTGCC TCGAAACCTT AGAAGAAATC TCCATCGGCG CTAAGGAAAC CTTTTTAGAT
GCAGGTGGCA GCGACTATCG GTTTATCCCT TGTTTGAACG ACGATGAGTT ACACATTGAA
TTATTACGGC AATTAATCCA AGAACAAGCG GTATCTTGGG CCCATACCCA AGCCACCTGA
 
Protein sequence
MSLTLNPSPA FGVLLVNLGT PDEPSPKAVK RFLRQFLSDP RVVDLSPWIW QPLLNGIILN 
TRPAKVAKLY QSVWTPEGSP LMVISARQAQ KLAVDLSATF NQTIPVELGM SYGNPSIDAG
FERLKDQGAE RIIVLPLYPQ YSCSTVASVF DAVASYLKTV RDIPQVRFNK DYFDHDAYIA
ALAHSVTRHW KTHGQADKLL LSFHGIPLRY VKEGDPYREQ CFVTAKLLAQ KLELSESQWQ
VCFQSRFGRE EWLTPYADQL LAELPAQGIK SVDVICPAFA TDCLETLEEI SIGAKETFLD
AGGSDYRFIP CLNDDELHIE LLRQLIQEQA VSWAHTQAT