Gene P9303_14381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_14381 
SymbolhcaE 
ID4778338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1235325 
End bp1236635 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content37% 
IMG OID640086947 
ProductRieske iron-sulfur protein 2Fe-2S subunit 
Protein accessionYP_001017449 
Protein GI124023142 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAGTTG AGTCTCCAAC CGAGAAAGCC ATAACTCCTT CAGTTGGCCT GCTGGGTTGG 
TATTCTGTGT GCGCCAGTCA GTTATTAAAG CAAAATGAAT TATATCATCT GTCCATGTAT
AATGAGCCAT TAGTTATATA CAGAGACAAG GAAAATAAAC CGAGATGTAT CAAAGATTCG
TGCCCTCATC GATCGGCATC ATTTCGTGGG GGAGAAAGTA AAAATGGCGA AATCATCTGC
CCATATCATG GAGCTCGCTA TACGACATCC TGCAATCAGG ATGGATTTGA TAGAATAACT
TGCAACCATA TTGTTGATTC TGATTATGAT AATTTTGCAA AATATTTACA CCTGAGGCAA
TATCCATGTG TAGAACAAGG TGATTATATA TATATTTATT ATACAGGGGA AGCAAAGACA
AGTCCAAATG ATTTCAAGAT AAACTCTGAG CTAGAACCAA GTCTGCCAGA GACGTATGGA
TTTGACTTAG CAGATTCAAA ATTTGAAGAA GTATTTATAG ACTTTAAATG CGATTGGTCT
CGCATCATAG AAAACCATTT AGACATCCTA CATATATTTT GGCTGCATGG TAATACTCTG
CCTGGAAATG ATGTCAACAG AGAAACAATT AAAAGCTTCA ATCAAACAAT CAATAAGGAT
CAATATCATC TTCGAAGTGT CTACAATGAG AAAGGAAATA AAAAGGAGGA GTTTATTTCT
CAAATCTTCA TCCCTCCTGG CCGTGTGATT ATATTCAAGG GCTCGCCTGA GCAGGCAAGA
TATGTACAAG TTTTAGATCA TATTCCTTTG GCTCACAATC GAGCACGAAT CATTGTTCGT
CATTATAGAA AGTTTCTTAA GAATAAGTTT CTGTGCAAAT TACTTCTCTT TAAGCAAAGG
CAGCAACAAG TATTTTACAA AATTTTCTCG GAAGATTATC TCGTCTTACA AACGCAAACC
TTTAATGAAC AGATGGGCTA CATGCACCAA GGGCAAAACA AACTTTTAGC AGAAGACAAG
ATGATTAAAC ATTTTTGGGA TTGGCATCAA CAATCCATTG AGAAAGAAAG CCCATGGACT
ATACACCCTA CATCGGCACA TACAAATACA ATTCATCAAG ATATGTTGAT GGTATACCCT
CCCGCAAACC CACAGTTATC TCATGATGTT CAACGTATAA TCGATCGTAA AGTGGCCGTT
CGTCTATTCT CTATAGTTAT AATCATACTA GCTTTCATTT TTGCGCCAAA CTTAGTTCAA
CAAATCAAGT CTGGAAATGA TTCAATACCT ATGGTTGAAA CTCAAGAATA A
 
Protein sequence
MTVESPTEKA ITPSVGLLGW YSVCASQLLK QNELYHLSMY NEPLVIYRDK ENKPRCIKDS 
CPHRSASFRG GESKNGEIIC PYHGARYTTS CNQDGFDRIT CNHIVDSDYD NFAKYLHLRQ
YPCVEQGDYI YIYYTGEAKT SPNDFKINSE LEPSLPETYG FDLADSKFEE VFIDFKCDWS
RIIENHLDIL HIFWLHGNTL PGNDVNRETI KSFNQTINKD QYHLRSVYNE KGNKKEEFIS
QIFIPPGRVI IFKGSPEQAR YVQVLDHIPL AHNRARIIVR HYRKFLKNKF LCKLLLFKQR
QQQVFYKIFS EDYLVLQTQT FNEQMGYMHQ GQNKLLAEDK MIKHFWDWHQ QSIEKESPWT
IHPTSAHTNT IHQDMLMVYP PANPQLSHDV QRIIDRKVAV RLFSIVIIIL AFIFAPNLVQ
QIKSGNDSIP MVETQE