Gene PCC8801_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0320 
SymbolhemH 
ID7104044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp315955 
End bp317118 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content42% 
IMG OID643473429 
Productferrochelatase 
Protein accessionYP_002370575 
Protein GI218245204 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGTG TTGGGGTCTT ATTACTAAAT TTAGGGGGAC CGGAACAATT AGAAGATGTT 
CGCCCCTTTT TATTTAATCT GTTTTCTGAT CCAGAAATTA TTCGGCTGCC CTTTCCTTGG
CTACAAAAGC CGTTAGCTTG GTTAATTTCT AGCTTAAGAA GCGAAAAATC TCAGGAAAAC
TACAAACAAA TTGGGGGAGG TTCTCCCTTA AGAAAGATTA CCGAAGCACA AGCCGAAGCC
CTAGAACAGA GATTAGCAGA AATTGGCCAC ACGGCACAGA TTTATATTGG GATGCGCTAT
TGGCATCCCT TTACCGAAGA AGCGATCGCA CGGATTAAAC GCGATCGCCT CAAAAATTTG
GTCATTCTTC CTCTCTATCC CCAATTTTCC ATCAGTACCA GTGGCTCTAG CTTCCGTGTT
CTCGAAGAAA TGTGGAATGC TGATCCTCAA CTCAAAGCTA TCAATTACAC CTTAATTCCT
TCTTGGTACG ATGATCCTAG GTATTTAGCA GCCATGGCTG ATTTAATCGC CCAAGAACTC
GATAAATGCG AAGAACCCAA CAGAGTCCAT ATTTTCTTTA GTGCCCACGG GGTTCCCCAA
AGTTATGTGG ATGAAGCGGG AGATCCCTAT CAAGCCGAAA TTGAAGCTTG TACCCGCTTA
ATCATGCAAA CCCTTAACCG TCCCAATGAT TACACCCTAG CTTATCAAAG TAGGGTTGGT
CCCGTGGAAT GGCTTAAACC CTACACCGAG GACGCACTCA AGGAATTGGG AGAACAGGGA
GTTCAAGATT TACTGGTAGT TCCTATTAGT TTTGTCTCTG AACATATCGA AACCTTACAA
GAAATTGATA TTGAATATCG AGAAGTGGCT GAAGAAGCAG GAATTGAAAA TTTTTATCGC
GTTCCTGCCT TAAATACCCA TCCCGTTTTT ATTGATTCTT TGGCGCAATT AGTGACAAAA
TCCCTTCAAG AACCCCCTTG TACTTTTAAT CAGGTGATAC ACCCTAAAGA AAACATGAAA
ATGTATCCTC AAGAGCGTTG GCAATGGGGT CTGACAACCG CAGCAGAAGT CTGGAATGGA
CGATTAGCAA TGGTGGGTTT TATTGCATTA TTAATTGAGT TAATTAGTGG CCATGGCCCC
TTACATTTTG TCGGATTACT TTAA
 
Protein sequence
MDRVGVLLLN LGGPEQLEDV RPFLFNLFSD PEIIRLPFPW LQKPLAWLIS SLRSEKSQEN 
YKQIGGGSPL RKITEAQAEA LEQRLAEIGH TAQIYIGMRY WHPFTEEAIA RIKRDRLKNL
VILPLYPQFS ISTSGSSFRV LEEMWNADPQ LKAINYTLIP SWYDDPRYLA AMADLIAQEL
DKCEEPNRVH IFFSAHGVPQ SYVDEAGDPY QAEIEACTRL IMQTLNRPND YTLAYQSRVG
PVEWLKPYTE DALKELGEQG VQDLLVVPIS FVSEHIETLQ EIDIEYREVA EEAGIENFYR
VPALNTHPVF IDSLAQLVTK SLQEPPCTFN QVIHPKENMK MYPQERWQWG LTTAAEVWNG
RLAMVGFIAL LIELISGHGP LHFVGLL