Gene Noc_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1545 
Symbol 
ID3705803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1715002 
End bp1716108 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content54% 
IMG OID637738030 
Productferrochelatase 
Protein accessionYP_343559 
Protein GI77165034 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTTTA AAGGTTATTC GGATTACCGC CATGATACGG TTGCCCGTAT CGGCGTTTTG 
GTGGCCAGTC TAGGAACACC AGAAGCGCCT ACGGCATCTG CGGTGCGGCG CTTTTTAGCT
AGCTTCTTAT CTGATCCGCG TGTGGTGGAG TTGCCCCGTC CTTTATGGTG GCTTATTCTC
CATGGCATTA TTTTGCGTAT TCGGCCATCC CCAGTGGCTC GTCTGTATCA AAGTATCTGG
CGGGAAGATG GATCTCCACT GCTGAGTTTT GCCCGGCGTG TGGGACAAAG CCTGCAAGCT
GAATTGGATA GCCGGGGAAG GTCTATTGAA ATAAGGTTAG GAATGCGTCA CGGCTCGCCT
TCCATCGAAA CGGCGCTGGA GGAACTGCGC CAGTCGGGAG CCCAGCGGCT ATTGGTTTTT
CCTCTTTATC CCCAGTACTC GGGAAGTACT ACCGGCTCTA CCTTTGATGC GGTAGCCCAA
GTGCTTTCCA CTTGGCGCTG GGTGCCGGAA TTGCGGATGA TTGCTCAATA CCATGATCAT
TCCGGCTACC TTGAGGCATT GGCGGAGACG ATTCGGCGTA GCTGGAAAGA GGCGGGGCGG
GGAGAGCGCT TGCTTATTTC TTTCCATGGC CTGCCGAAAC GGTACTTACT AGCCGGCGAT
CCCTATCATT GCCAGTGCCA AAAAACCGCT CGCCTTTTGG CAGAGCGATT AGGATTAAAA
GAGGGCGAAT GGCAAATAGC TTTTCAGTCC CGTTTTGGCC GTGAAGAATG GCTTAAGCCC
TATGCTGATC ACCTCTTGCA AGCCTGGGCC GAAGCCGGAA TAAAACGGGT GGATGTCGTT
TGCCCTGGGT TTGCTGTCGA TTGTTTAGAA ACTCTGGAAG AGATGGCCCA GCGTAACAGG
GAACTGTTTT TACACGCAGG AGGAGAAGAG TATCGCTATA TTCCCGCGCT TAACGATGAG
TCTGCCCATA TCCGTGCTTT GACCGATCTG GTTGAGCAAC ATATCCAAGG GTGGTCCGAA
GCCGATTTAG GTGGGGGGCG GGAGGCGACG GGTCAAGCCG CCGAGAGAAG CCGTCAACGG
GCTTTGGCGC TTGGCGCTAA GCAATAA
 
Protein sequence
MNFKGYSDYR HDTVARIGVL VASLGTPEAP TASAVRRFLA SFLSDPRVVE LPRPLWWLIL 
HGIILRIRPS PVARLYQSIW REDGSPLLSF ARRVGQSLQA ELDSRGRSIE IRLGMRHGSP
SIETALEELR QSGAQRLLVF PLYPQYSGST TGSTFDAVAQ VLSTWRWVPE LRMIAQYHDH
SGYLEALAET IRRSWKEAGR GERLLISFHG LPKRYLLAGD PYHCQCQKTA RLLAERLGLK
EGEWQIAFQS RFGREEWLKP YADHLLQAWA EAGIKRVDVV CPGFAVDCLE TLEEMAQRNR
ELFLHAGGEE YRYIPALNDE SAHIRALTDL VEQHIQGWSE ADLGGGREAT GQAAERSRQR
ALALGAKQ