Gene Nwi_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0191 
Symbol 
ID3676647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp221163 
End bp222323 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID637711729 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_316811 
Protein GI75674390 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.605644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCCCG CTGATCCGGC CGCTGCGTTC GGCGTTTACA TCCACTGGCC GTTCTGCCTG 
TCGAAGTGTC CGTATTGCGA TTTCAACAGT CACGTGCGCC ACGCGCCGAT CGATGAAGAA
CGCTTTACAC GGGCTTTCGC GCGCGAGATC GAGACCACGG CGGCCCGCTC GCGCGGACGC
ACGGTGTCCT CGATTTTCCT CGGCGGCGGC ACGCCGTCCC TGATGCGGCC GCAGACCGTG
GGCGCCATTC TCGACGCAAT CGGAAAGCAT TGGAGCGTTG CCGATGACGT CGAGGTCACG
CTGGAAGCCA ATCCGACCAG CGTGGAGGCC ACACGATTCC GCGGCTACCG CGCCGCGGGC
GTCAATCGGG TCTCGCTCGG GGTGCAGGCG CTCGACGATT CCTCGCTGAA GGCGCTCGGT
CGGCTGCACA CGGCGCGCGA GGCGCAGGAT GCGGTTGCGA TCGCGCGCTC GGTCTTCGAT
CGCTATTCCT TCGATCTGAT CTATGCGCGT CCGGACCAGA CACCGGCGAT GTGGGCCGGG
GAGTTGCAGC GCGCGATCTC GGAGGCGGCG GAGCATCTGT CGCTCTATCA ATTGACGATC
GAAGCGGGGA CGCCGTTTTT CGACCTGCAT GCGGCGGGGA AGCTCAAGAC CCCCGATGAG
GCGATGGCGC GAGACTTGTA CGACGTCACG CAGGACGTCT GCGCGCGTCA TGGCCTGCCC
GCTTACGAGA TCTCGAACCA TGCGCGGCCG GGCGCGGAAT GCCGGCACAA TCTGGTCTAC
TGGCGCGGTC AGGAATATGC GGGCATCGGC CCCGGAGCGC ACGGTCGCCT CGACATCGAC
GGCGTCAGGC ACGCCATCGC CACCGAGAAG CGGCCCGAAG CCTGGCTCAT GCGCGTCGAG
GCGAACGGCA ACGGAATTGT CGCCGATGAC CCGTTGAACA GCGAAGAGCG CGCCGACGAA
TTCCTGCTCA TGGGACTGCG CTTGCGCGAG GGCATCGATC CGCGGCGCTA CGCGGCGCTG
TCGGGCCGCT CTCTTGATCC CCGGCGCATC GCGATCCTCC GTGACGAAGG CGCGATCATG
ATCAGCACCG ACGGCCGTTT GCGCGTCACG CAGGATGGCT TCCCCGTCCT CGATGCCGTG
GTGGCCGATC TCGCCGCATA G
 
Protein sequence
MSPADPAAAF GVYIHWPFCL SKCPYCDFNS HVRHAPIDEE RFTRAFAREI ETTAARSRGR 
TVSSIFLGGG TPSLMRPQTV GAILDAIGKH WSVADDVEVT LEANPTSVEA TRFRGYRAAG
VNRVSLGVQA LDDSSLKALG RLHTAREAQD AVAIARSVFD RYSFDLIYAR PDQTPAMWAG
ELQRAISEAA EHLSLYQLTI EAGTPFFDLH AAGKLKTPDE AMARDLYDVT QDVCARHGLP
AYEISNHARP GAECRHNLVY WRGQEYAGIG PGAHGRLDID GVRHAIATEK RPEAWLMRVE
ANGNGIVADD PLNSEERADE FLLMGLRLRE GIDPRRYAAL SGRSLDPRRI AILRDEGAIM
ISTDGRLRVT QDGFPVLDAV VADLAA