Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_19920 |
Symbol | hemN |
ID | 7760922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1978721 |
End bp | 1980097 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643804890 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_002799174 |
Protein GI | 226944101 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00538] oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCTC CGATCAAATG GGATACCAAC CTGATCCGTC GCTACGATCA GGCGGGTCCG CGCTACACCT CATACCCCAC GGCAGTGCAG TTCCAGCCTG GCGTGGGCTC CTTCAAGCTG CTCAACGCAT TGCGCGAAAG CCGCCAGGCC AAGCGGCCGC TGTCGATCTA CATACACATT CCGTTCTGCG CCCACGTCTG CTACTACTGT GCCTGCAACA AGGTCATCAC CAAGGATCGC GGCCGCGCCC AGCCTTACCT CGAACGGCTG GAGAAGGAAA TCGACATCGT CAGCCGCCAC CTTTCGCCGC AACAACAGGT CGAGCAACTG CACCTGGGCG GCGGCACACC GACCTTCCTC AGCCACGGCG AACTGCAGCG CTTGATGGAG TTTCTGCGTC AACACTTCGG CTTCGCCGAA AAAGTCGACT GCAGCATCGA GATCGACCCT CGGGAAGCGG ACTGGTCGAC CATGGGGCTG CTCCGCGAGC TGGGCTTCAA CCGCATCAGC CTCGGCGTGC AGGATCTCGA TCCCGGCGTC CAGCGCGCCG TGAACCGCCT GCAGAGCCTG GAGGAAACCA GCGCCATCAT CGAAGCGGCG CGGACCCTGG AGTTCCGCTC GATCAATGTC GATCTCATCT ATGGCCTGCC CAAGCAGACA CCGGAAACCT TCACCCGCAC CGTCGAAGAA ATCATCGCCC TGCAACCCGA TCGACTGTCG GTGTTCAACT ATGCCCACCT GCCGGAACGC TTCAAACCGC AGCGGCGAAT CAGCGCCGCC GACCTGCCGG CACCAGCCGA TAAACTGACC ATGCTGCACA ACAGCATCGA GCAACTCTCG GCCGCCGGCT ATCAGTACAT CGGCATGGAT CATTTCGCCC TGCCGGACGA CGAACTGTCC ATCGCCCAGG AAGACGGCAG GCTGCAACGC AACTTCCAGG GCTACACCAC GCACGGCCAT TGCGACCTGA TCGGCCTGGG TGTTTCGTCC ATCAGCCAGA TCGGCGACCT CTATAGCCAG AACGAGAGCG ACATCGACGC CTACCAGAGC CGCCTCGACA ACGATCAGTT GCCGACCGCC CGCGGCCTGC AATGCAACAC CGACGACCGG ATTCGCCGCG CGGCCATCCA GAAACTCATA TGCGATTTCC AATTGGACTT CGCCAGCCTG GAACAACGGT TCGGTATCGT CTTCCGCGAC TACTTCGCCG ATATCTGGCC TCAGCTACAG AGCATGACCC GCGACGGGCT GATCGCCCTG TCCCCTCAAG GCATCGAAGT GCTACCAGCC GGACGCCTGC TCGTTCGTTC GATCTGCATG CTTTTCGACT ATTATCTGGT CGAACACAAT CGCCAGCGCT TCTCCCGTAT CATCTGA
|
Protein sequence | MPAPIKWDTN LIRRYDQAGP RYTSYPTAVQ FQPGVGSFKL LNALRESRQA KRPLSIYIHI PFCAHVCYYC ACNKVITKDR GRAQPYLERL EKEIDIVSRH LSPQQQVEQL HLGGGTPTFL SHGELQRLME FLRQHFGFAE KVDCSIEIDP READWSTMGL LRELGFNRIS LGVQDLDPGV QRAVNRLQSL EETSAIIEAA RTLEFRSINV DLIYGLPKQT PETFTRTVEE IIALQPDRLS VFNYAHLPER FKPQRRISAA DLPAPADKLT MLHNSIEQLS AAGYQYIGMD HFALPDDELS IAQEDGRLQR NFQGYTTHGH CDLIGLGVSS ISQIGDLYSQ NESDIDAYQS RLDNDQLPTA RGLQCNTDDR IRRAAIQKLI CDFQLDFASL EQRFGIVFRD YFADIWPQLQ SMTRDGLIAL SPQGIEVLPA GRLLVRSICM LFDYYLVEHN RQRFSRII
|
| |