Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_45190 |
Symbol | hemE |
ID | 7763388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4580710 |
End bp | 4581777 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807368 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_002801609 |
Protein GI | 226946536 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTCC TGAAGAACGA CCGCTTTCTC CGCGCCCTGC TCAAGCAACC CGTCGATGTC ACTCCGGTGT GGATGATGCG CCAGGCCGGC CGCTACCTGC CGGAATACCG GGCGACCCGG GCGAAGGCGG GCGATTTCAT GAGTCTGTGC ATGAATCCCG AGCTGGCCTG CGAGGTGACC CTGCAGCCGC TGGACCGCTA TCCGCAACTG GACGCGGCGA TCCTCTTCTC CGACATCCTC ACCGTTCCCG ACGCCATGGG GCTGGGTCTG TATTTCGAGA CCGGGGAGGG GCCGCGCTTT CGCAAAGTGG TTTCCAGCCC GGCGGACATC GAGGCCCTGC CGGTGCCCGA TCCCGAGCGG GACCTGGGCT ACGTGATGGC GGCGGTACGC ACCATCCGCC GCGAGCTGAA CGGCCGCGTG CCGTTGATCG GCTTCTCCGG CAGTCCCTGG ACCCTGGCCA CCTACATGGT CGAGGGCGGC TCCAGCAAGG ACTTCCGCAA GTCCAAGGCG ATGCTCTACG ACAATCCGCA GGCCATGCAC GCCCTGCTCG ACAAGCTGGC CCGGGCGGTC ACCGCCTACC TCAACGGGCA GATCCTGGCC GGCGCCCAGG CGGTGCAGAT CTTCGACTCC TGGGGCGGCA GCCTGTCGTC GGCGGCCTAC CAGGAGTTTT CCCTGGCCTA CATGAAGAGG ATCGTCGACG GCCTGATCCG CGAGCACGAA GGCCGGCGCG TGCCGGTGAT CCTCTTCACC AAGGGCGGCG GCCTGTGGCT GGAGGCCATG GCCGGGAGCG GCGCCGAGGC CCTGGGCCTG GACTGGACCT GCGACATCGG CGATGCCCGT GCCCGCGTCG GCGGCAAGGT GGCTCTGCAG GGCAACATGG ACCCGAGCGT CCTCTACGCC AACCCGGCGG CGATCCGCGC CGAGGTGGCG CGCATCCTCG CCCGTTACGG CGCGGGCTCC GGGCATGTCT TCAACCTCGG CCACGGCATC ACCCCCGAGG TCGATCCGGC CCATGCCGGC GCCTTCTTCG AGGCGGTGCA CGAACTGTCG GCGCAATATC ACCGCTGA
|
Protein sequence | MTVLKNDRFL RALLKQPVDV TPVWMMRQAG RYLPEYRATR AKAGDFMSLC MNPELACEVT LQPLDRYPQL DAAILFSDIL TVPDAMGLGL YFETGEGPRF RKVVSSPADI EALPVPDPER DLGYVMAAVR TIRRELNGRV PLIGFSGSPW TLATYMVEGG SSKDFRKSKA MLYDNPQAMH ALLDKLARAV TAYLNGQILA GAQAVQIFDS WGGSLSSAAY QEFSLAYMKR IVDGLIREHE GRRVPVILFT KGGGLWLEAM AGSGAEALGL DWTCDIGDAR ARVGGKVALQ GNMDPSVLYA NPAAIRAEVA RILARYGAGS GHVFNLGHGI TPEVDPAHAG AFFEAVHELS AQYHR
|
| |