Gene Noc_2444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2444 
Symbol 
ID3704601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2788950 
End bp2790107 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content50% 
IMG OID637738924 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_344428 
Protein GI77165903 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCAAT TCCATGCTTT ACCTCCGCTT GCGCTGTATA TCCATTTTCC CTGGTGTGTC 
CGCAAGTGTC CCTACTGCGA TTTTAACTCC CATTCCTTGC CAGAAAAATT TCCGGAAACC
GCCTATATCG ACGCTCTAGT TCGGGATCTG GAACAGGATT TGCCTAGAGT CTGGGGCCGT
CGCATCCACA GTATCTTTAT GGGCGGCGGA ACACCTAGCC TTTTTTCTGC GGAAAGCCTC
GATCGGCTAC TCTCCGCTAT ACGCGCCCGC CTCCCTTGGT CCCCCCAAAT AGAGATTACT
CTGGAAGCCA ACCCTGGAAC TGTAGAACAG GGGCGTTTCA CCGAATTCCA TCATATCGGC
ATCAACCGCC TCTCCCTGGG TGTCCAAAGT TTTAACGATG AAGCGCTTAA GCACCTGGGA
CGAATCCACT CTTCCCAGGA ATCCTGTCAA GCGATAGCAG CGGCCCATAC CGCCGGGTTT
AATAATATTA ACTTGGATCT TATGTTCGGC CTTCCTGGGC AAACTGAAAC CCAGGCCCTT
GCTGATGTGG CCATGGCCAT TAGTCTCGCC CCCCACCATA TTTCCTATTA TCAGCTTACC
CTTGAGCCTA ATACCTACTT TTATCGCTAT CCTCCATCCC TGCCCCATGA AGAAGCAATT
AATAGCTGGC AAACAGATTG CCAAGTCCGC TTAGCCCAGG CGGGATATCA CCGCTATGAG
GTGTCTGCTT TTGCCAAAAG CGATCGGCAA TGCCAGCATA ACTTGAATTA TTGGCGATTT
GGGGATTACC TTGGGATCGG TGCTGGTGCC CATGGAAAAA TTTCCGATGC CATGGAAAAC
CACATTACTC GAATATGGAA AATAAAACAT CCTACAAGCT ACCTAGCCAA AGCAGGCACC
CCGGAAGGCA TAGGAGGAGA AACGAATCCC TCTCCCCAAG AGGCGGCCTT CGAATTTATG
CTTAATGCCC TGCGCTTAAA TGGAGGGATT CCCAGTCGTC TATTTCAAGA ACGGGTGGGC
CTTCCTATTA GTGTAGTGGA AAAAACATTA CATCAAGCAG AGGCGTTGAA GCTGATAGAG
TGGGATTTTC GGCATATCCG CCCGACGGAA AAAGGCTTCT CTTTGCTCAA CGAGCTATTA
CAACTGTTTC TACCATAA
 
Protein sequence
MFQFHALPPL ALYIHFPWCV RKCPYCDFNS HSLPEKFPET AYIDALVRDL EQDLPRVWGR 
RIHSIFMGGG TPSLFSAESL DRLLSAIRAR LPWSPQIEIT LEANPGTVEQ GRFTEFHHIG
INRLSLGVQS FNDEALKHLG RIHSSQESCQ AIAAAHTAGF NNINLDLMFG LPGQTETQAL
ADVAMAISLA PHHISYYQLT LEPNTYFYRY PPSLPHEEAI NSWQTDCQVR LAQAGYHRYE
VSAFAKSDRQ CQHNLNYWRF GDYLGIGAGA HGKISDAMEN HITRIWKIKH PTSYLAKAGT
PEGIGGETNP SPQEAAFEFM LNALRLNGGI PSRLFQERVG LPISVVEKTL HQAEALKLIE
WDFRHIRPTE KGFSLLNELL QLFLP