Gene Avin_19920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_19920 
SymbolhemN 
ID7760922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1978721 
End bp1980097 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content61% 
IMG OID643804890 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_002799174 
Protein GI226944101 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCTC CGATCAAATG GGATACCAAC CTGATCCGTC GCTACGATCA GGCGGGTCCG 
CGCTACACCT CATACCCCAC GGCAGTGCAG TTCCAGCCTG GCGTGGGCTC CTTCAAGCTG
CTCAACGCAT TGCGCGAAAG CCGCCAGGCC AAGCGGCCGC TGTCGATCTA CATACACATT
CCGTTCTGCG CCCACGTCTG CTACTACTGT GCCTGCAACA AGGTCATCAC CAAGGATCGC
GGCCGCGCCC AGCCTTACCT CGAACGGCTG GAGAAGGAAA TCGACATCGT CAGCCGCCAC
CTTTCGCCGC AACAACAGGT CGAGCAACTG CACCTGGGCG GCGGCACACC GACCTTCCTC
AGCCACGGCG AACTGCAGCG CTTGATGGAG TTTCTGCGTC AACACTTCGG CTTCGCCGAA
AAAGTCGACT GCAGCATCGA GATCGACCCT CGGGAAGCGG ACTGGTCGAC CATGGGGCTG
CTCCGCGAGC TGGGCTTCAA CCGCATCAGC CTCGGCGTGC AGGATCTCGA TCCCGGCGTC
CAGCGCGCCG TGAACCGCCT GCAGAGCCTG GAGGAAACCA GCGCCATCAT CGAAGCGGCG
CGGACCCTGG AGTTCCGCTC GATCAATGTC GATCTCATCT ATGGCCTGCC CAAGCAGACA
CCGGAAACCT TCACCCGCAC CGTCGAAGAA ATCATCGCCC TGCAACCCGA TCGACTGTCG
GTGTTCAACT ATGCCCACCT GCCGGAACGC TTCAAACCGC AGCGGCGAAT CAGCGCCGCC
GACCTGCCGG CACCAGCCGA TAAACTGACC ATGCTGCACA ACAGCATCGA GCAACTCTCG
GCCGCCGGCT ATCAGTACAT CGGCATGGAT CATTTCGCCC TGCCGGACGA CGAACTGTCC
ATCGCCCAGG AAGACGGCAG GCTGCAACGC AACTTCCAGG GCTACACCAC GCACGGCCAT
TGCGACCTGA TCGGCCTGGG TGTTTCGTCC ATCAGCCAGA TCGGCGACCT CTATAGCCAG
AACGAGAGCG ACATCGACGC CTACCAGAGC CGCCTCGACA ACGATCAGTT GCCGACCGCC
CGCGGCCTGC AATGCAACAC CGACGACCGG ATTCGCCGCG CGGCCATCCA GAAACTCATA
TGCGATTTCC AATTGGACTT CGCCAGCCTG GAACAACGGT TCGGTATCGT CTTCCGCGAC
TACTTCGCCG ATATCTGGCC TCAGCTACAG AGCATGACCC GCGACGGGCT GATCGCCCTG
TCCCCTCAAG GCATCGAAGT GCTACCAGCC GGACGCCTGC TCGTTCGTTC GATCTGCATG
CTTTTCGACT ATTATCTGGT CGAACACAAT CGCCAGCGCT TCTCCCGTAT CATCTGA
 
Protein sequence
MPAPIKWDTN LIRRYDQAGP RYTSYPTAVQ FQPGVGSFKL LNALRESRQA KRPLSIYIHI 
PFCAHVCYYC ACNKVITKDR GRAQPYLERL EKEIDIVSRH LSPQQQVEQL HLGGGTPTFL
SHGELQRLME FLRQHFGFAE KVDCSIEIDP READWSTMGL LRELGFNRIS LGVQDLDPGV
QRAVNRLQSL EETSAIIEAA RTLEFRSINV DLIYGLPKQT PETFTRTVEE IIALQPDRLS
VFNYAHLPER FKPQRRISAA DLPAPADKLT MLHNSIEQLS AAGYQYIGMD HFALPDDELS
IAQEDGRLQR NFQGYTTHGH CDLIGLGVSS ISQIGDLYSQ NESDIDAYQS RLDNDQLPTA
RGLQCNTDDR IRRAAIQKLI CDFQLDFASL EQRFGIVFRD YFADIWPQLQ SMTRDGLIAL
SPQGIEVLPA GRLLVRSICM LFDYYLVEHN RQRFSRII