Gene P9211_09481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_09481 
SymbolhcaE 
ID5731058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp844891 
End bp846237 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content33% 
IMG OID641285315 
ProductRieske iron-sulfur protein 2Fe-2S subunit 
Protein accessionYP_001550833 
Protein GI159903489 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.056748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000260646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGAACG CTATGGAAGA TGACAGGTTT CAACAATTTA ATAAAGTTAA GTTGAGCAAT 
GGTGCTTCCA TAACAAAGAA TTTGCTCGAA AATACTAGTA ATGAGAAAAA GACTCATAAA
TCTAAACAAC CTACCAATCA GTTAAAGAGT GGTCTTCTTG GTTGGTATGC AGTTTGTAGT
ATTCGAGAAA TAAGTGGGGA TGATCCTTAT TTCTTTACTA TGTTCAATGA ACCGTTGATG
ATATATAAAG ATAAAGATTC AAATTTAAGG TGTATAAAAG ATCTATGCCC ACACAGAGGG
GCCTCGTTCC GAGGAGGCCA AATTATAGAC GGAGAACTTG TTTGTCCTTA CCATGGCGCG
AAATTCTCAT CTACAGGAAA GTGCACCAAT TTAAGTAGGA TAACCTGCAA TCATATAGTT
GACAGCAATT ACAATAATTA TGCAACTAAG ATACATCTCT ATCAGTACTT ATGCAAAGAA
GTAGGAGACT ATATATTTAT AAATTATACT GGTAGTTCAT CTACAAACCT TGAAGAAATA
GAAGTTAAAG AAAATATAGA TTCAAAAATA CTTAACACCT ATGGATTTAA AACAGAGGAG
TATAAATTTG AAGAAGTTAT AGTAGATTTT AAATGTGATT GGGCTAGAAT AGTTGAGAAT
CACCTAGACA TACTTCACTT ATTTTGGGTT CATGGAGAAA CTATACCTGA TGCCGATGTT
AACAGAAATG TTATTACAAG TTTCAATCAA GAAATAACAC GAGACAGCAA TCAAATAGAA
AGCAAGTATA AATACAAGGA AAAGGATAAA GGTGAATTTA TAAGGATCAA ATTTCTTCCA
CCTGGTCGGA TTATAATTTA CAAGGGTAAT CCAGAGGAAT CACGGTATAT TCAAGTTCTG
GATCATATTC CCCTTGCTAA GAATCAAGCA AGAGTTATAG TTCGGCATTA CAGGAAGTTC
CTTAAAAATA ATTTCTTTAA TAGTTTAATT CTATTTAAAA ATCTTCAGCA TAGAATATTC
TATAAGGTTT TTGCCGAAGA CTATATGATT CTAAGGACTC AAACATTTAA TGACCAAATG
GGATATATAG AAAAGGACAA TGTTAAGCTT TTAGGAGAAG ATAAAATGAT CCAATATTAT
TGGGATTGGT ACAAAAACTC CCTAAATGAG GATAAGCCAT GGGACATACA TCCAATTAAA
AGTGACACAA ATAGTGTCCA TCAAGAATTA GCTATGTTAT ACCCTCCTGA GAATAAAATA
CTAGCAGAAA AAAACAACAG GGAAATTGTG GTTAAGTTAA TAGCTAGGCT AATTATTCCT
ATAGGGCTCG CATTCTTATT AATCTAA
 
Protein sequence
MQNAMEDDRF QQFNKVKLSN GASITKNLLE NTSNEKKTHK SKQPTNQLKS GLLGWYAVCS 
IREISGDDPY FFTMFNEPLM IYKDKDSNLR CIKDLCPHRG ASFRGGQIID GELVCPYHGA
KFSSTGKCTN LSRITCNHIV DSNYNNYATK IHLYQYLCKE VGDYIFINYT GSSSTNLEEI
EVKENIDSKI LNTYGFKTEE YKFEEVIVDF KCDWARIVEN HLDILHLFWV HGETIPDADV
NRNVITSFNQ EITRDSNQIE SKYKYKEKDK GEFIRIKFLP PGRIIIYKGN PEESRYIQVL
DHIPLAKNQA RVIVRHYRKF LKNNFFNSLI LFKNLQHRIF YKVFAEDYMI LRTQTFNDQM
GYIEKDNVKL LGEDKMIQYY WDWYKNSLNE DKPWDIHPIK SDTNSVHQEL AMLYPPENKI
LAEKNNREIV VKLIARLIIP IGLAFLLI