Gene Syncc9902_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1643 
SymbolhemH 
ID3743921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1590928 
End bp1592103 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content55% 
IMG OID637771835 
Productferrochelatase 
Protein accessionYP_377645 
Protein GI162138553 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGCG TCGGCGTCAT CCTGCTGAAC CTGGGCGGAC CAGAAAGAAT CCAGGACGTC 
GGCCCGTTCC TGTACAACCT TTTTGCCGAT CCCGAAATCA TTCGATTGCC CAGTCCAGCG
CTGCAAAAGC CCCTGGCCTG GCTGATCAGC ACCCTGAGAA GTGGCAAATC ACAGGAGGCC
TACCGCTCGA TCGGAGGCGG TTCACCTCTG CGACGGATCA CAGAGCAACA GGCCCGTGAG
CTGCAGAGTT TGTTGCGGCA ACGGGGGATT GATGCCACCA GCTATGTGGC GATGCGCTAT
TGGCATCCCT TCACAGAATC GGCTGTGGCC GACATCAAGG CCGACGGTAT GGATCAAGTG
GTGGTGTTAC CGCTGTATCC CCATTTTTCG ATCAGTACGA GTGGATCCAG TTTCCGGGAG
TTGCAGCGAT TAAGGCAGGG AGACAACGCG TTTGAACAGC TCCCAATTCG ATGCATCCGC
AGCTGGTACG ACCACCCCGG ATATCTGCGA TCCATGGCTG AGTTGATCGC CACTGAGATT
CACAACAGTG ATGTTCCAGA AGCTGCGCAC GTGTTTTTCA GTGCCCATGG TGTTCCCAAA
AGTTATGTCG AAGAGGCCGG CGATCCCTAT CAACAGGAGA TCGAGAAATG CACCGCTCTA
ATCATGGAGA AACTGGCTGA ACTTGTGGGG CATAGCAATC CCCATACCCT CGCCTATCAG
AGCCGTGTGG GTCCGGTGGA GTGGCTTCAG CCCTACACCG AAGAAGCCTT AGAAGAGTTG
GGCCATGCGA AAACACAGGA TCTCGTTGTT GTACCGATCA GTTTCGTAAG CGAGCACATC
GAAACACTGG AAGAGATTGA TATCGAATAT CGAGAGTTGG CAACGGAAGC AGGGGTTGTG
AATTTTCGTC GGGTTCGCGC CCTCGACACC TACAAACCTT TCATCGAGGG TCTGGCCGAT
CTGGTCACCA CAAGTCTGGA AGGCCCCGAG GTGAGCCTTG ATGCCGCAGC CGAGTTGCCA
ACCAAAGTGA AGCTGTATCC TCAAGAAAAG TGGGAATGGG GCTGGAATAA CAGCTCGGAA
GTCTGGAATG GCCGTATCGC CATGGTGGGT TTTTCGGCGT TTCTGCTCGA ACTCCTGAGT
GGTCATGGTC CCTTGCATGC CCTAGGCCTG CTTTAG
 
Protein sequence
MSRVGVILLN LGGPERIQDV GPFLYNLFAD PEIIRLPSPA LQKPLAWLIS TLRSGKSQEA 
YRSIGGGSPL RRITEQQARE LQSLLRQRGI DATSYVAMRY WHPFTESAVA DIKADGMDQV
VVLPLYPHFS ISTSGSSFRE LQRLRQGDNA FEQLPIRCIR SWYDHPGYLR SMAELIATEI
HNSDVPEAAH VFFSAHGVPK SYVEEAGDPY QQEIEKCTAL IMEKLAELVG HSNPHTLAYQ
SRVGPVEWLQ PYTEEALEEL GHAKTQDLVV VPISFVSEHI ETLEEIDIEY RELATEAGVV
NFRRVRALDT YKPFIEGLAD LVTTSLEGPE VSLDAAAELP TKVKLYPQEK WEWGWNNSSE
VWNGRIAMVG FSAFLLELLS GHGPLHALGL L