Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_09481 |
Symbol | hcaE |
ID | 5731058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 844891 |
End bp | 846237 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641285315 |
Product | Rieske iron-sulfur protein 2Fe-2S subunit |
Protein accession | YP_001550833 |
Protein GI | 159903489 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.056748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000260646 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGAACG CTATGGAAGA TGACAGGTTT CAACAATTTA ATAAAGTTAA GTTGAGCAAT GGTGCTTCCA TAACAAAGAA TTTGCTCGAA AATACTAGTA ATGAGAAAAA GACTCATAAA TCTAAACAAC CTACCAATCA GTTAAAGAGT GGTCTTCTTG GTTGGTATGC AGTTTGTAGT ATTCGAGAAA TAAGTGGGGA TGATCCTTAT TTCTTTACTA TGTTCAATGA ACCGTTGATG ATATATAAAG ATAAAGATTC AAATTTAAGG TGTATAAAAG ATCTATGCCC ACACAGAGGG GCCTCGTTCC GAGGAGGCCA AATTATAGAC GGAGAACTTG TTTGTCCTTA CCATGGCGCG AAATTCTCAT CTACAGGAAA GTGCACCAAT TTAAGTAGGA TAACCTGCAA TCATATAGTT GACAGCAATT ACAATAATTA TGCAACTAAG ATACATCTCT ATCAGTACTT ATGCAAAGAA GTAGGAGACT ATATATTTAT AAATTATACT GGTAGTTCAT CTACAAACCT TGAAGAAATA GAAGTTAAAG AAAATATAGA TTCAAAAATA CTTAACACCT ATGGATTTAA AACAGAGGAG TATAAATTTG AAGAAGTTAT AGTAGATTTT AAATGTGATT GGGCTAGAAT AGTTGAGAAT CACCTAGACA TACTTCACTT ATTTTGGGTT CATGGAGAAA CTATACCTGA TGCCGATGTT AACAGAAATG TTATTACAAG TTTCAATCAA GAAATAACAC GAGACAGCAA TCAAATAGAA AGCAAGTATA AATACAAGGA AAAGGATAAA GGTGAATTTA TAAGGATCAA ATTTCTTCCA CCTGGTCGGA TTATAATTTA CAAGGGTAAT CCAGAGGAAT CACGGTATAT TCAAGTTCTG GATCATATTC CCCTTGCTAA GAATCAAGCA AGAGTTATAG TTCGGCATTA CAGGAAGTTC CTTAAAAATA ATTTCTTTAA TAGTTTAATT CTATTTAAAA ATCTTCAGCA TAGAATATTC TATAAGGTTT TTGCCGAAGA CTATATGATT CTAAGGACTC AAACATTTAA TGACCAAATG GGATATATAG AAAAGGACAA TGTTAAGCTT TTAGGAGAAG ATAAAATGAT CCAATATTAT TGGGATTGGT ACAAAAACTC CCTAAATGAG GATAAGCCAT GGGACATACA TCCAATTAAA AGTGACACAA ATAGTGTCCA TCAAGAATTA GCTATGTTAT ACCCTCCTGA GAATAAAATA CTAGCAGAAA AAAACAACAG GGAAATTGTG GTTAAGTTAA TAGCTAGGCT AATTATTCCT ATAGGGCTCG CATTCTTATT AATCTAA
|
Protein sequence | MQNAMEDDRF QQFNKVKLSN GASITKNLLE NTSNEKKTHK SKQPTNQLKS GLLGWYAVCS IREISGDDPY FFTMFNEPLM IYKDKDSNLR CIKDLCPHRG ASFRGGQIID GELVCPYHGA KFSSTGKCTN LSRITCNHIV DSNYNNYATK IHLYQYLCKE VGDYIFINYT GSSSTNLEEI EVKENIDSKI LNTYGFKTEE YKFEEVIVDF KCDWARIVEN HLDILHLFWV HGETIPDADV NRNVITSFNQ EITRDSNQIE SKYKYKEKDK GEFIRIKFLP PGRIIIYKGN PEESRYIQVL DHIPLAKNQA RVIVRHYRKF LKNNFFNSLI LFKNLQHRIF YKVFAEDYMI LRTQTFNDQM GYIEKDNVKL LGEDKMIQYY WDWYKNSLNE DKPWDIHPIK SDTNSVHQEL AMLYPPENKI LAEKNNREIV VKLIARLIIP IGLAFLLI
|
| |