Gene Noc_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3046 
Symbol 
ID3704345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3440857 
End bp3442455 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content49% 
IMG OID637739520 
Productcytochrome c oxidase 
Protein accessionYP_345017 
Protein GI77166492 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000491174 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAG TAGCAGCACA CGGTGATCAC GCGCATCACC CTACCGGCAT CATGCGCTGG 
TTGACAACCA CCAACCATAA GGATATTGGT ACCTTATACC TGTTTTTTAG CCTTACTATG
TTTTTTGTTG GCGGCGCGAT GGCGCTGACA TTCCGTACCG AACTTTTTGC GCCTGGATTA
CAAATTCTAG ATCCCCAAAG ATTTAATGAG TTGGTGACCC TGCATGGGTT AGTCATGATC
TTTGGCGCGA TGATGCCCGT TCTAGCGGGT TTAGCTAACT GGCAAATACC GCTTATGATT
GGCGCGCCTG ATATGGCTTT GCCGCGGTTA AATAACTGGA GCTTCTGGCT CTTACCCTTT
GCCATGCTTT TGCTTCTTAG CAGTTTGCTG GTGCCGGGCG GAGCAGCGGC TGGGGGATGG
ACCATGTACC CACCCTTGTT TATCCAGGGC GGGGTTGGCA TTGATATGAC CATCTTTTCT
GTCCATCTTC TGGGACTTTC TTCCATATTG GCGTCGATCA ATATTATTGT TACCGTCCTA
AACATGCGAG CACCTGGCAT GGGTCTGATG AAAATGCCTA TGTTCGTCTG GGGATGGTTG
ATCACTGCCT TTTTGCTGGT TGCGGTGGCT CCGGTACTTG CGGGCGCCGT GACCATGGAG
CTTACCGACC GTCATTTTGG CACCAGTTTC TTTAATGCGG CTGGCGGCGG TGACCCGGTG
ATGTACCAGC ACATTTTCTG GTTTTTTGGC CATCCCGAAG TCTATATTAT GGTTTTACCT
ATTTTCGGGG TGATATCGGA TATTATTCCG ACTTTTGCCC GTAAGCCAAT ATTTGGCTAT
CACTCCATGG TCTACGCTTT AGCTTCGATT GCCTTCCTCT CCTTCATCGT GTGGGCGCAT
CACATGTTTA CCGTCGGCAT GCCGCTTTCA GGAGAGTTGT ACTTTATGTA TGCAACCGTC
CTGATTTCCG TTCCCACTGG GATCAAAATT TTTAATTGGC TTACCACCAT GTGGCGGGGT
TCCATGACTT TTGAGTTGCC CATGCTGTGG TCTATGGCCT TCATCGCTTT ATTTACTATT
GGCGGCCTGA CTGGCCTTAT GATGGGCGTA GCCGCGGCGG ATTTTCAGTA CCATGATACC
TATTTTATTG TTTCCCACTT CCACTATGTA TTTCTGCCGG TGACGCTATT TGGTACCTAT
GCTGCTGTTT ACTACTGGCT ACCTAAATGG ACTGGTAATT GGTATGACGC GCGTCTAGGG
AAATGGCATT TCTGGCTGTC CGTAATTTCA ATGAATATCG TTTTCTTTCC GCAGAATTTC
CTTGGCTTGG CGGGCATGCC GCGGCGAATT CCTGACTACG CCATTCAGTT CGCTGAATTC
AATGCGATTT CCACCATAGG TGCTTTCATT TTCGGTTTCT CTCAGTTGAT CTTTGTATAT
GTGATTATTA AGGCTATTCG TGGTGGCGCA GGTGTGGAAA AAGCTACCGA CCAGGTATGG
GAAGGCGCAA AGGGTTTAGA GTGGACACTT AGCTCTCCGC CCCCTTACCA TAGTTTCACA
ACTCCACCCC AAGTCACGGC GGAGAATAAT CCCCATTAA
 
Protein sequence
MSTVAAHGDH AHHPTGIMRW LTTTNHKDIG TLYLFFSLTM FFVGGAMALT FRTELFAPGL 
QILDPQRFNE LVTLHGLVMI FGAMMPVLAG LANWQIPLMI GAPDMALPRL NNWSFWLLPF
AMLLLLSSLL VPGGAAAGGW TMYPPLFIQG GVGIDMTIFS VHLLGLSSIL ASINIIVTVL
NMRAPGMGLM KMPMFVWGWL ITAFLLVAVA PVLAGAVTME LTDRHFGTSF FNAAGGGDPV
MYQHIFWFFG HPEVYIMVLP IFGVISDIIP TFARKPIFGY HSMVYALASI AFLSFIVWAH
HMFTVGMPLS GELYFMYATV LISVPTGIKI FNWLTTMWRG SMTFELPMLW SMAFIALFTI
GGLTGLMMGV AAADFQYHDT YFIVSHFHYV FLPVTLFGTY AAVYYWLPKW TGNWYDARLG
KWHFWLSVIS MNIVFFPQNF LGLAGMPRRI PDYAIQFAEF NAISTIGAFI FGFSQLIFVY
VIIKAIRGGA GVEKATDQVW EGAKGLEWTL SSPPPYHSFT TPPQVTAENN PH