Gene Noc_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0114 
Symbol 
ID3705874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp119410 
End bp120861 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content50% 
IMG OID637736630 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_342177 
Protein GI77163652 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAGA AAACTATCTT GCCTTTCTGC TTATTATTAG CAGGTATACC CCTCTATGCA 
GTGGCCAATC AGGAGCCATC TATCCCTACA AACGGCCCCA ATTCTCACGG CCATTTAACC
TTATTTTCAT ATTTGTCAGC AAATAACCTA GCCTACTCTG GTGCTAATTA TCGTTTGCCT
CTCCCTGCGA CTGACTCTGA TTTTTACGAT CATGGTAGGC CTGATCCTGC GAAGTTCCAG
TTAGGTAAAT TCCTTTTTTT CGACAAGATC TTAAGCGGCA ATCAAAATAT CTCCTGCGCC
ACCTGCCATC ACCCCTTGGC TGCTACCGGC GATGGTCTTT CCTTGTCCGT GGGGGAAGGG
GGACGAGGAC TAGGAATTAC CCGAAATACG GGTTTCGGGA ATGAGGCTAT TTACGAGCGC
GTTCCCCGGA ATGCACCTCC GCTTTTTAAT GTAGGCGCCA AACACATGAC CGTGATGTTT
TACGATGGCC GGGCAGAAGT AGATTCCTTA GCACCCAGCG GGTTCAATAC CCCGGCAGGG
GATGAGTTAC CTATTGGAGT TCTAGAAAAT GTGTTGGCTG CTCAAGCTAT GTTTCCAGTC
ACTTCCAATT CAGAAATGGC AGGGCAACCA GGGGAAAATC CTATTGCCGA TGCGGCAGAA
GCTGGCAATT TAGCAGGACC CGGGGGTGTT TGGGAACAGC TCGCGAATCG GTTAAAAGCT
AACGAAGAGT ATGCTCAACT GTTTAACCAA GCTTTTGCCC TGACCCCTGA GCAAATTACC
TATGCTCATG CCGCCAACGC TATTGCGGCT TTTGAGGCAG CGGCATGGCG AGCGGATCAT
AGCCCTTTTG ATCGCTACCT GCGCGGCGAT AAAGAGGCAA TGAGCAGAGA ATCCATTCAA
GGAATGAGCC TCTTCTATGG GAAGGCAGGT TGCTCTCAAT GCCACAGCGG AGTTTTTCAA
ACCGATATGC AATATCATGC CATTGCAATG CCCCAGATTG GTCCGGGCAA AGGAGATGGC
GCTGAAGGTC ATGAGGATTT TGGACGGGAG CGGGTTACTC ACGATCCCGC TGATCGCTAT
AAGTTTCGCA CCCCCCCGCT TCGAAATGTG GCGCTTACAG GCCCCTGGGG GCATGATGGC
ACCTACAATA CCTTGGAAGC TGTGGTTCGC CATCACCTAG ATCCAGTGAG TTCCCTTCTC
ACTTACTCAT GCCTAAGGGA GGCCCAACTA CCTTATCGTG AGGATTTAGA TTTTTTCGAT
TGCCTAGTAC AAAACGATAG CGCTAAGGTT AACCTGATTG CTGCTGCTAA CACACTCCCC
ACAAGGGAGT TAAGCGATGA TGAGGTAAAG CATCTGCTGG CATTCCTTCA TGCTTTAACG
GATCCTATCA GTCTCGATTT GCGCGGTGAT ATTCCAGACA GAGTACCTAG CAATCTGACG
CTTGTGGAGT AA
 
Protein sequence
MRKKTILPFC LLLAGIPLYA VANQEPSIPT NGPNSHGHLT LFSYLSANNL AYSGANYRLP 
LPATDSDFYD HGRPDPAKFQ LGKFLFFDKI LSGNQNISCA TCHHPLAATG DGLSLSVGEG
GRGLGITRNT GFGNEAIYER VPRNAPPLFN VGAKHMTVMF YDGRAEVDSL APSGFNTPAG
DELPIGVLEN VLAAQAMFPV TSNSEMAGQP GENPIADAAE AGNLAGPGGV WEQLANRLKA
NEEYAQLFNQ AFALTPEQIT YAHAANAIAA FEAAAWRADH SPFDRYLRGD KEAMSRESIQ
GMSLFYGKAG CSQCHSGVFQ TDMQYHAIAM PQIGPGKGDG AEGHEDFGRE RVTHDPADRY
KFRTPPLRNV ALTGPWGHDG TYNTLEAVVR HHLDPVSSLL TYSCLREAQL PYREDLDFFD
CLVQNDSAKV NLIAAANTLP TRELSDDEVK HLLAFLHALT DPISLDLRGD IPDRVPSNLT
LVE