Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_14381 |
Symbol | hcaE |
ID | 4778338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1235325 |
End bp | 1236635 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640086947 |
Product | Rieske iron-sulfur protein 2Fe-2S subunit |
Protein accession | YP_001017449 |
Protein GI | 124023142 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACAGTTG AGTCTCCAAC CGAGAAAGCC ATAACTCCTT CAGTTGGCCT GCTGGGTTGG TATTCTGTGT GCGCCAGTCA GTTATTAAAG CAAAATGAAT TATATCATCT GTCCATGTAT AATGAGCCAT TAGTTATATA CAGAGACAAG GAAAATAAAC CGAGATGTAT CAAAGATTCG TGCCCTCATC GATCGGCATC ATTTCGTGGG GGAGAAAGTA AAAATGGCGA AATCATCTGC CCATATCATG GAGCTCGCTA TACGACATCC TGCAATCAGG ATGGATTTGA TAGAATAACT TGCAACCATA TTGTTGATTC TGATTATGAT AATTTTGCAA AATATTTACA CCTGAGGCAA TATCCATGTG TAGAACAAGG TGATTATATA TATATTTATT ATACAGGGGA AGCAAAGACA AGTCCAAATG ATTTCAAGAT AAACTCTGAG CTAGAACCAA GTCTGCCAGA GACGTATGGA TTTGACTTAG CAGATTCAAA ATTTGAAGAA GTATTTATAG ACTTTAAATG CGATTGGTCT CGCATCATAG AAAACCATTT AGACATCCTA CATATATTTT GGCTGCATGG TAATACTCTG CCTGGAAATG ATGTCAACAG AGAAACAATT AAAAGCTTCA ATCAAACAAT CAATAAGGAT CAATATCATC TTCGAAGTGT CTACAATGAG AAAGGAAATA AAAAGGAGGA GTTTATTTCT CAAATCTTCA TCCCTCCTGG CCGTGTGATT ATATTCAAGG GCTCGCCTGA GCAGGCAAGA TATGTACAAG TTTTAGATCA TATTCCTTTG GCTCACAATC GAGCACGAAT CATTGTTCGT CATTATAGAA AGTTTCTTAA GAATAAGTTT CTGTGCAAAT TACTTCTCTT TAAGCAAAGG CAGCAACAAG TATTTTACAA AATTTTCTCG GAAGATTATC TCGTCTTACA AACGCAAACC TTTAATGAAC AGATGGGCTA CATGCACCAA GGGCAAAACA AACTTTTAGC AGAAGACAAG ATGATTAAAC ATTTTTGGGA TTGGCATCAA CAATCCATTG AGAAAGAAAG CCCATGGACT ATACACCCTA CATCGGCACA TACAAATACA ATTCATCAAG ATATGTTGAT GGTATACCCT CCCGCAAACC CACAGTTATC TCATGATGTT CAACGTATAA TCGATCGTAA AGTGGCCGTT CGTCTATTCT CTATAGTTAT AATCATACTA GCTTTCATTT TTGCGCCAAA CTTAGTTCAA CAAATCAAGT CTGGAAATGA TTCAATACCT ATGGTTGAAA CTCAAGAATA A
|
Protein sequence | MTVESPTEKA ITPSVGLLGW YSVCASQLLK QNELYHLSMY NEPLVIYRDK ENKPRCIKDS CPHRSASFRG GESKNGEIIC PYHGARYTTS CNQDGFDRIT CNHIVDSDYD NFAKYLHLRQ YPCVEQGDYI YIYYTGEAKT SPNDFKINSE LEPSLPETYG FDLADSKFEE VFIDFKCDWS RIIENHLDIL HIFWLHGNTL PGNDVNRETI KSFNQTINKD QYHLRSVYNE KGNKKEEFIS QIFIPPGRVI IFKGSPEQAR YVQVLDHIPL AHNRARIIVR HYRKFLKNKF LCKLLLFKQR QQQVFYKIFS EDYLVLQTQT FNEQMGYMHQ GQNKLLAEDK MIKHFWDWHQ QSIEKESPWT IHPTSAHTNT IHQDMLMVYP PANPQLSHDV QRIIDRKVAV RLFSIVIIIL AFIFAPNLVQ QIKSGNDSIP MVETQE
|
| |