Gene NATL1_08501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08501 
SymbolhcaE 
ID4780198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp780659 
End bp781993 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content31% 
IMG OID640084125 
ProductRieske iron-sulfur protein 2Fe-2S subunit 
Protein accessionYP_001014673 
Protein GI124025557 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00185666 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGAAG AAAATGGAAA AGACAATAAA TCATTTGAAT ATGAGACTAA TAACCTCATT 
AGATCTAATT TTAATGAGAA AGAAAGTGTA AAAGAACTAG AGAATATTGC TCCCAAGCCA
TCTAATCAAC TAACAAATGG ACTATTAGGT TGGTATTCAG TATCGAGTAG CGAGTCAATA
AAGGAGGGTA AATTAAATCA CTTCACAATT TACAATGAGC CGCTTGTTTT GTATCGTGAT
AGAGAAGGTA TTGTTAGATG TGTAAAAGAT GTCTGTCCTC ATAGAGGAGC TTCATTTCTA
GGCGGTGAAG TGATAAACGG ACAACTTGTT TGCCCTTATC ACGGTGCAAG GTTCTCATCT
CAAGGAAGTT GCACAAATTT AGATAGAATA ACCTGCCAGC ATATTATTGA TTCTAATTAC
GATAACTATG CAAAAAGCAT AAAACTTTTT CAATATCCAT GCGTAGAGAA AGAAGGATAT
ATTTATATTT ATTATACAGG TACACCTCTA GCAAACATTG AAGACTTTCA GATAAAATCT
TCAATCAATA GCCTTCTGCC TGACTCCTAT GGATTTCCAT CTTTAGAATA TGAATATGAA
GAAGTTTATG TAGATTTCAA AGCAGACTGG GCAAGAATTA TAGAAAATCA TCTAGATATA
TTGCATGTAT TCTGGATGCA CGGAGACACA ATCCCCGACA AGAACGTAAA CAGAGAAACA
ATAACAAGCT TCAATCAAAA AATAAAAAGA GATAATAGGC AAATAGAAAG TATATATTCA
TATAAAACAA ATGGGCAAGA AGAGTTTATT AGAATAAAAT TCGTACCTCC GGGAAGAATT
TTTATATATA AAGGCTCACC TGAAAGTACA AGATATATTC AAGTTCTAGA TCATATTCCA
CTAGGAAATA ATAAAGCAAG AGTAATTGTA AGGCATTATA GAAAATTCCT TAAAAATAAA
TTTTTTACAA ACCTAGTTTT ATTTAGCCAT CTACAAAGAA GAACATTTTA TAAGATTTTC
ACTGAAGATT ATTTAGTCTT AAAAACTCAA ACATTTAATG ATCAAATGGG CTACATACAA
AAAGATAATG TAAAATTATT AGGAGAAGAT AAAATGGTTC AATATTACTG GGATTGGCTT
CAAAATGCTT TAAATAAAGA AAAACCATGG GACTTACATC CAACCAATTC ATTGACTAAT
TCAGTTCATG AGGATAGAGG AATGCAATAT CCTCCAGAAA ATCCTAATAT GGCCATAAAG
AATAATAGAA AGATAATTAT AAAACTTTTA ACTAGATTAT TATTCCCAAT TAGTTTTATT
CTACTATTAA TATAA
 
Protein sequence
MNEENGKDNK SFEYETNNLI RSNFNEKESV KELENIAPKP SNQLTNGLLG WYSVSSSESI 
KEGKLNHFTI YNEPLVLYRD REGIVRCVKD VCPHRGASFL GGEVINGQLV CPYHGARFSS
QGSCTNLDRI TCQHIIDSNY DNYAKSIKLF QYPCVEKEGY IYIYYTGTPL ANIEDFQIKS
SINSLLPDSY GFPSLEYEYE EVYVDFKADW ARIIENHLDI LHVFWMHGDT IPDKNVNRET
ITSFNQKIKR DNRQIESIYS YKTNGQEEFI RIKFVPPGRI FIYKGSPEST RYIQVLDHIP
LGNNKARVIV RHYRKFLKNK FFTNLVLFSH LQRRTFYKIF TEDYLVLKTQ TFNDQMGYIQ
KDNVKLLGED KMVQYYWDWL QNALNKEKPW DLHPTNSLTN SVHEDRGMQY PPENPNMAIK
NNRKIIIKLL TRLLFPISFI LLLI