Gene P9515_07721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_07721 
SymbolhcaE 
ID4719437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp699308 
End bp700627 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content26% 
IMG OID640080451 
ProductRieske iron-sulfur protein 2Fe-2S subunit 
Protein accessionYP_001011088 
Protein GI123966007 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.725202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA AGCAAATTAA TTTTTTTAAA GCTAAAGATA TTAATACCGT TTTAAAACCT 
TATAAAAAGG GTACTGTAGT TAATATCGAT AACTTTGAGG CGAGAGAAAG ACAAAAAGAA
TTAAAGACAG GTTTATATGG ATGGTATGCA ATTTGCCCTT CAAATGTACT TAAGAAAAAT
AAGATTCATT ATTTCTCCTT ATTTGATGAA CCTCTTCTTC TATTTAGAGA CAATAAAAAT
AATGTAAGAT GTATCAAAAA TATTTGTCCC CATAGGGGGG CCTCTTTTTA TGGTGGATCA
ATTTCTAATG GAGAATTAAC TTGCCCATAC CATGGAGCAA GATTCAGTTC TGAAGGTAAT
TGTCAGAATA TTGATAGCAT AACTTGTAGA CATATTGTTG ATAATAATTA TGATAATTAT
GCTAAGAGGA TACACTTATC TCAATACAAA ACAGTAGAAG AGGATAACTA CATTTTTATA
TATTTTTCCG ATAAGTCTGA GATGGATTTG AATAATATTA AAGAAGAGCC ATCAATTAGT
AATTATGAAT TAATTAGTAA CGGATTTTCT ATTGAAGACT CAGTATCAGA AGAGGTTTTA
GTTGATTTTA AATGTGATTG GTCAAGAATA ATTGAAAATC ATTTAGATAT TCTTCACATA
TTTTGGGTAC ATGGAGATAC AATTCCAGAT AAAGAAGTAA ATAAAAATGT TTTGGTAAGT
TTTAATCAGA AAATAAATAT CAATCCAAAT TATATTGAAA GCATATATTT CTATAAAAAA
AATCCAACAA AAGAATTTAT AAGAATAAAG TATATTCCTC CAGGAAGAAT TTTAATTTAT
AAGGGAGATC CAGCTGTTTC AAGGTATGTT CAAGTGCTTG ATCATATCCC CTTAGGCGAA
AATAAAGCAA GAGTAATTGT TAGACATTAC AGGAAATTTC TTAAAAACAA ATTACTTAAT
AACCTAATAT TGTTTAAAGA AAATCAAAAA AAGATTTTTT ATAAAATTTT TAATGAAGAT
TACATGATTC TTAAAACCCA GACTTACAAT CATAAGATGG GATTAATAAA AAATGATGAG
ATAAAACTAC TTGGTGAAGA TAGAATGATT AATTATTTCT GGAATTGGTA TAAAAAATCG
GAAGAAAAGG ATACTCCATG GAAATATATA AATAATAAAG AACTTAACGT TTATGATGAA
ATTATATTTA AATATCCTCC CGAAATTAAG AAATTAGAAG TAATTAATAA TATCAATATA
ATAAGAAAAG CATTTATAAG ATATGCTGCC CCACTTATCT TTTTACTGCT AATAATATAA
 
Protein sequence
MENKQINFFK AKDINTVLKP YKKGTVVNID NFEARERQKE LKTGLYGWYA ICPSNVLKKN 
KIHYFSLFDE PLLLFRDNKN NVRCIKNICP HRGASFYGGS ISNGELTCPY HGARFSSEGN
CQNIDSITCR HIVDNNYDNY AKRIHLSQYK TVEEDNYIFI YFSDKSEMDL NNIKEEPSIS
NYELISNGFS IEDSVSEEVL VDFKCDWSRI IENHLDILHI FWVHGDTIPD KEVNKNVLVS
FNQKININPN YIESIYFYKK NPTKEFIRIK YIPPGRILIY KGDPAVSRYV QVLDHIPLGE
NKARVIVRHY RKFLKNKLLN NLILFKENQK KIFYKIFNED YMILKTQTYN HKMGLIKNDE
IKLLGEDRMI NYFWNWYKKS EEKDTPWKYI NNKELNVYDE IIFKYPPEIK KLEVINNINI
IRKAFIRYAA PLIFLLLII