Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08501 |
Symbol | hcaE |
ID | 4780198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 780659 |
End bp | 781993 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640084125 |
Product | Rieske iron-sulfur protein 2Fe-2S subunit |
Protein accession | YP_001014673 |
Protein GI | 124025557 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00185666 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGAAG AAAATGGAAA AGACAATAAA TCATTTGAAT ATGAGACTAA TAACCTCATT AGATCTAATT TTAATGAGAA AGAAAGTGTA AAAGAACTAG AGAATATTGC TCCCAAGCCA TCTAATCAAC TAACAAATGG ACTATTAGGT TGGTATTCAG TATCGAGTAG CGAGTCAATA AAGGAGGGTA AATTAAATCA CTTCACAATT TACAATGAGC CGCTTGTTTT GTATCGTGAT AGAGAAGGTA TTGTTAGATG TGTAAAAGAT GTCTGTCCTC ATAGAGGAGC TTCATTTCTA GGCGGTGAAG TGATAAACGG ACAACTTGTT TGCCCTTATC ACGGTGCAAG GTTCTCATCT CAAGGAAGTT GCACAAATTT AGATAGAATA ACCTGCCAGC ATATTATTGA TTCTAATTAC GATAACTATG CAAAAAGCAT AAAACTTTTT CAATATCCAT GCGTAGAGAA AGAAGGATAT ATTTATATTT ATTATACAGG TACACCTCTA GCAAACATTG AAGACTTTCA GATAAAATCT TCAATCAATA GCCTTCTGCC TGACTCCTAT GGATTTCCAT CTTTAGAATA TGAATATGAA GAAGTTTATG TAGATTTCAA AGCAGACTGG GCAAGAATTA TAGAAAATCA TCTAGATATA TTGCATGTAT TCTGGATGCA CGGAGACACA ATCCCCGACA AGAACGTAAA CAGAGAAACA ATAACAAGCT TCAATCAAAA AATAAAAAGA GATAATAGGC AAATAGAAAG TATATATTCA TATAAAACAA ATGGGCAAGA AGAGTTTATT AGAATAAAAT TCGTACCTCC GGGAAGAATT TTTATATATA AAGGCTCACC TGAAAGTACA AGATATATTC AAGTTCTAGA TCATATTCCA CTAGGAAATA ATAAAGCAAG AGTAATTGTA AGGCATTATA GAAAATTCCT TAAAAATAAA TTTTTTACAA ACCTAGTTTT ATTTAGCCAT CTACAAAGAA GAACATTTTA TAAGATTTTC ACTGAAGATT ATTTAGTCTT AAAAACTCAA ACATTTAATG ATCAAATGGG CTACATACAA AAAGATAATG TAAAATTATT AGGAGAAGAT AAAATGGTTC AATATTACTG GGATTGGCTT CAAAATGCTT TAAATAAAGA AAAACCATGG GACTTACATC CAACCAATTC ATTGACTAAT TCAGTTCATG AGGATAGAGG AATGCAATAT CCTCCAGAAA ATCCTAATAT GGCCATAAAG AATAATAGAA AGATAATTAT AAAACTTTTA ACTAGATTAT TATTCCCAAT TAGTTTTATT CTACTATTAA TATAA
|
Protein sequence | MNEENGKDNK SFEYETNNLI RSNFNEKESV KELENIAPKP SNQLTNGLLG WYSVSSSESI KEGKLNHFTI YNEPLVLYRD REGIVRCVKD VCPHRGASFL GGEVINGQLV CPYHGARFSS QGSCTNLDRI TCQHIIDSNY DNYAKSIKLF QYPCVEKEGY IYIYYTGTPL ANIEDFQIKS SINSLLPDSY GFPSLEYEYE EVYVDFKADW ARIIENHLDI LHVFWMHGDT IPDKNVNRET ITSFNQKIKR DNRQIESIYS YKTNGQEEFI RIKFVPPGRI FIYKGSPEST RYIQVLDHIP LGNNKARVIV RHYRKFLKNK FFTNLVLFSH LQRRTFYKIF TEDYLVLKTQ TFNDQMGYIQ KDNVKLLGED KMVQYYWDWL QNALNKEKPW DLHPTNSLTN SVHEDRGMQY PPENPNMAIK NNRKIIIKLL TRLLFPISFI LLLI
|
| |