Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_31250 |
Symbol | |
ID | 7762025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3230214 |
End bp | 3231476 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643806000 |
Product | protocatechuate 4,5-dioxygenase |
Protein accession | YP_002800264 |
Protein GI | 226945191 |
COG category | [S] Function unknown |
COG ID | [COG3384] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02792] protocatechuate 4,5-dioxygenase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGAA TCATTGGCGG CCTCGCCGTC TCCCACACAC CGACCATCGG CTTCGCCGTC GACAACGACA AGCAACACGA CGAAGCCTGG GCGCCGATCT TCAAGAGTTT CGAGCCGGTC TCGGCATGGC TGCGGGAAAA GCGGCCGGAC GTCCTGTTCT ACATCTTCAA CGACCATGTG ACCTCGTTCT TCTTCAATCA CTACGCGGCC TTCAACCTGG GCGTGGACGA GCGCTACGAA CCCGCCGACG AAGGGGGCGG CCCGCGCGCC CTGCCGGCCG TCGAAGGCCA TGCCGAGCTG GCCCGGCACA TCGGCGCAAG CCTGATGGCC GACGAGTTCG ACATGGCGTT CTTCCGCGAC AAGCCCCTGG ACCACGGGCT GTTCTCGCCG ATGTCGGCGA TCCTGCCGCC CGATGCCCGC TCCGGATGGC CGGTGAAGAT CGTTCCGCTG CAGGTCGGCG TGCTGCAGTT CCCGATTCCC AGCGCCGCCC GCTGCTACAA GCTGGGCCAG GCACTGCGCC GGGCCATCGA GAGCTACCCC GAGGATCTGA AGGTGGCGAT CGTGTCGACC GGCGGCCTCT CGCATCAGGT CCACGGCGAG CGGTGCGGCT TCAACGACCC GCAATGGGAC GCCCAGTTCG TCGATCTGCT GGTCAACGAT CCGGTGCGCC TGACCGAGCT GACCGTCGCC GAATACGCAG CCCTCGGCGG CGTGGAAGGT GCCGAGGTGA TCATGTGGCT GATCATGCGC GGCGCCCTGT CCGCCACGGT GAAGAAGGTG CACCAGGATT ACTACCTGCC GTCGATGACC GGGATCGCCA CCCTGATCCT GGAGAATCGG GACCGCGAAG TGCCGGTGGA CCTGCATGAG CGCCACCGTC GGCACATGGA CCATCAACTG GCCGGAGCGG ACCGGCTCGA AGGCACCTAC CCGTTCGACC TGGCGCGCAG CGCCAAGGGC TACCGGCTGA ACAAGTTCCT GCACGGGCTG ATCTCGCCCG CCTTCCGCGA GCGCTTCAAG GAAGAGCCGG AAACCCTGTT CGAAGAACAC CGGCTCAGCG AGCAGGAGCG CGACATGCTC CGCCGCCTCG ACTGGCGCGC CCTGATCCAG TACGGGGCGA GCTTCTTCGT GCTGGAAAAG CTCGGCGCGG TCGTCGGCGT CTCCAACCTG CACATCTATT CGGCCATGCG CGGCCAGTCG CTCGAGGAGT TCCAGAAGAC CCGCAACCGG CAGGTCCTCT ACTCGGTGGC CGGCAAACGC TGA
|
Protein sequence | MARIIGGLAV SHTPTIGFAV DNDKQHDEAW APIFKSFEPV SAWLREKRPD VLFYIFNDHV TSFFFNHYAA FNLGVDERYE PADEGGGPRA LPAVEGHAEL ARHIGASLMA DEFDMAFFRD KPLDHGLFSP MSAILPPDAR SGWPVKIVPL QVGVLQFPIP SAARCYKLGQ ALRRAIESYP EDLKVAIVST GGLSHQVHGE RCGFNDPQWD AQFVDLLVND PVRLTELTVA EYAALGGVEG AEVIMWLIMR GALSATVKKV HQDYYLPSMT GIATLILENR DREVPVDLHE RHRRHMDHQL AGADRLEGTY PFDLARSAKG YRLNKFLHGL ISPAFRERFK EEPETLFEEH RLSEQERDML RRLDWRALIQ YGASFFVLEK LGAVVGVSNL HIYSAMRGQS LEEFQKTRNR QVLYSVAGKR
|
| |