Gene P9301_08691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_08691 
SymbolhcaE 
ID4911016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp752828 
End bp754150 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content28% 
IMG OID640160451 
ProductRieske iron-sulfur protein 2Fe-2S subunit 
Protein accessionYP_001091093 
Protein GI126696207 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACA GACAAATTAA TTTTTTTAAA TCAAAAGACT TTAATACCGT TCTTAAGCCA 
TTTAAAAAAG GAACGGTAGT AAAAATTGAC TCGTTTGATA TTAGAGAAAA TCAAAAAGAA
TTAAATATAG GTTTATTTGG TTGGTATGCA ATTTGTCCCT CTAAAGAACT AAAAAAAAAT
AAGCTTTATT ATTTTTCACT CTATGATGAG CCGCTTGTTC TTTATAGAGA TGAAAATAAA
AACGTAAGGT GTATTAAAAA TATTTGTCCA CACCGAGGAG CCTCCTTTTT TGGAGGAACA
TTATCAGGTG GAGTAATAAC CTGCCCATAT CATGGAGCTA AGTTTTCATC TGGAGGAAGT
TGCCAAAATC TCGACAGAAT AACATGTCGC CATATAGTTG ATAATAACTA CGATAACTAC
GCTAAAAGAA TTCATTTATC TCAATACAAA ACCTCAGAAA AAAATGGATA TATTTTTGTA
CATTTTTCTA AAAAATCTGA GACTGATTTA AATAACATAA ATGAAGATAC ACCTGTAAGT
AACTACGAAT TATATGAAAA TGGATTTGCA CATAAGGATT ATGTCTTTGA GGAGGTATTA
GTTGACTTTA AATGTGATTG GTCAAGGATT ATTGAAAATC ACCTAGATAT TCTTCATATC
TTTTGGGTTC ATGGCGATAC AATTCCTGAT AAAGATGTGA ATAAAAACGT ACTTGTTAGT
TTTAACCAGA AAATTAATAT TAATCCCAAA TACATTGAAA GTATTTATTA TTACAAGAAT
GACCCTACAA AAGAATTTAT TCGGATAAAA TACATACCTC CAGGAAGGAT ATTAATCTAC
AAAGGTGATC CTTCCTCATC AAGATATTTA CAAGTTTTAG ATCATATTCC TCTAGGAAAA
AACAAAGCAA GAGTAATAGT AAGACACTAT AGGAAATTTC TACAAAATAA ACTACTTAAT
AACCTCTTAT TATTTAAAGA GACTCAAAGA AAGATTTTTT ATAAGATATT TGATGAGGAT
TATATGATTT TAAAAACACA AACATATAAT CACGATATGG GATTTATTAG TAAGGATGAA
ATAAAATTAT TGGGAGAAGA TAGAATAATA AATTATTTTT GGAAGTGGTA CAAGAGGTCT
GAAGATAATG ATAAACCATG GAAAAATAAT AACAAAACCC AAAATCTTGA TGTATATGAC
AAAGTGATAT TGAAATATCC TCCTGAGATA AAGAAGTTAG AAATTGCAAA TAATATAGAT
ATTATTAGAA AAACAATCGT AAGATTTGCT GCTCCGCTTA TATTTTTCAT GTTAATAATA
TAA
 
Protein sequence
MENRQINFFK SKDFNTVLKP FKKGTVVKID SFDIRENQKE LNIGLFGWYA ICPSKELKKN 
KLYYFSLYDE PLVLYRDENK NVRCIKNICP HRGASFFGGT LSGGVITCPY HGAKFSSGGS
CQNLDRITCR HIVDNNYDNY AKRIHLSQYK TSEKNGYIFV HFSKKSETDL NNINEDTPVS
NYELYENGFA HKDYVFEEVL VDFKCDWSRI IENHLDILHI FWVHGDTIPD KDVNKNVLVS
FNQKININPK YIESIYYYKN DPTKEFIRIK YIPPGRILIY KGDPSSSRYL QVLDHIPLGK
NKARVIVRHY RKFLQNKLLN NLLLFKETQR KIFYKIFDED YMILKTQTYN HDMGFISKDE
IKLLGEDRII NYFWKWYKRS EDNDKPWKNN NKTQNLDVYD KVILKYPPEI KKLEIANNID
IIRKTIVRFA APLIFFMLII