Gene NATL1_04991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04991 
SymbolcyoB 
ID4780961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp453952 
End bp455592 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content39% 
IMG OID640083774 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_001014326 
Protein GI124025210 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.217716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTT CACTCAAACC AAACAACTCT TCTCCAGAAA AACTTCAACC AACAGGATGG 
CTCAAATATC TGAGTTTTAG TCTTGATCAC AAAGTCATAG GACTTCAATA TTTAGTTTGT
GGTTTCTTGT TTTATTTAAT TGGAGGATCT TTAGCTGGAG CAATAAGAGT AGAACTGATC
AGTCCCCTCT CAGATTTTAT GCCTAGAGAG GTTTACAACC AGGTTTTAAC TCTCCACGGC
ACAATCATGA TTTTCTTATG GATCGTGCCT GTAGTAAATG GTGCCTTTGG AAACTACTTA
ATACCCTTTT ATGTTGGGGC TAGGGACATG GCTTTCCCAA GGCTGAATGC AGTTGCTTTT
TGGCTAATAC CTCCCTCTGG CTTAATGTTG ATAACAAGTT ATTTCATAAA CGGAGCCGCA
CAATCAGGGT GGACAGCTTA CCCACCTTTA AGCATTACGA CTCCAGCCGC TGGGCAAATC
ATTTGGATAT TAAGTGTCTT GCTTTTGGGT GGGAGCTCAA TTTTTGGTGG CATTAATTTC
ATCGCCACCA TCCTCAAGCT AAGGAGACCT GGTCTTAAAT TAATGCAATT ACCTATGTAT
TGCTGGGCAA TGCTTGGCAC GAGTATTCTT GTAGTTCTAT CAACTCCTGT TCTCGCTGGT
ACTCTAATAC TGCTGAGCTT TGACATAGTT GCTCATACCG GTTTCTTCAA CCCAAGTCTT
GGTGGGAATG TAATCGTTTA TCAACATCTT TTTTGGTTTT ACTCGCATCC TGCTGTTTAT
ATCATGGTCT TACCTGCCTT TGGTTTAGTC AGTGAAATAC TTCCAATCCA TAGTAGAAAA
CCACTTTTTG GCTATACAAC TATGGTCTTC TCAATTATGG GAATAGTTGT GTTGGGTCTA
GTTGTTTGGG CTCATCATAT GTTTACAAGC GGTACTCCTC CTTGGATGCG CTTATTTTTT
ACAATCGCTA CTGCATTTAT AGCTGTTCCC ACAGGTATTA AATTTTTTAA TTGGGTTGCG
ACCTTATGGG GAGGAAAAAT ATCACTTAAT GCTGCAATGT TATTTTCCTG CGGATTTATT
ATTAACTTTG TTTTAGGTGG AATAACGGGA GTTGCCCTAG CGCAAGTACC TTTTGATGTT
CATGTACACG ACACTTATTT TGTTGTTGCC CATTTTCATT ACATAGTTTA TGGCGGTTCT
GTCTTTGTTA TCTTCTCCTC GATTTATCAC TGGTTCCCAA AATTTACTGG AAAAATGCTC
AATGAAAATC TTGGAAGATT CCACTTTATA ATTACTTTTA TAGGCTTTAA TCTTTGTTTT
GCTCCTCAAC ATTGGCTAGG TTTAAACGGA ATGCCTCGAC GAGTTGCCGA ATACGATCCA
CAATTTCAGT TAATCAATCA AATTAGTAGT GTAGGTGCAT TATTAATGGC TTTAAGTACT
TTACCTTTCT TATGGAATAT TCTTCAAAGC ATCCTCTTTG GGGAAGAGGC TGGTGATAAC
CCTTGGAATG CACTAACTCC TGAGTGGTTA ACAAGTTCTC CTCCTCCTGT TGAGAATTGG
GACGGAGAAG CCCCACTAGT TCTTGAACCT TATGGTTATG GAGAAAAAGA TTCAAATGAG
ACTCAGGAGG CAATCAGATG A
 
Protein sequence
MTISLKPNNS SPEKLQPTGW LKYLSFSLDH KVIGLQYLVC GFLFYLIGGS LAGAIRVELI 
SPLSDFMPRE VYNQVLTLHG TIMIFLWIVP VVNGAFGNYL IPFYVGARDM AFPRLNAVAF
WLIPPSGLML ITSYFINGAA QSGWTAYPPL SITTPAAGQI IWILSVLLLG GSSIFGGINF
IATILKLRRP GLKLMQLPMY CWAMLGTSIL VVLSTPVLAG TLILLSFDIV AHTGFFNPSL
GGNVIVYQHL FWFYSHPAVY IMVLPAFGLV SEILPIHSRK PLFGYTTMVF SIMGIVVLGL
VVWAHHMFTS GTPPWMRLFF TIATAFIAVP TGIKFFNWVA TLWGGKISLN AAMLFSCGFI
INFVLGGITG VALAQVPFDV HVHDTYFVVA HFHYIVYGGS VFVIFSSIYH WFPKFTGKML
NENLGRFHFI ITFIGFNLCF APQHWLGLNG MPRRVAEYDP QFQLINQISS VGALLMALST
LPFLWNILQS ILFGEEAGDN PWNALTPEWL TSSPPPVENW DGEAPLVLEP YGYGEKDSNE
TQEAIR