Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_22390 |
Symbol | |
ID | 7761157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2238862 |
End bp | 2239818 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643805124 |
Product | Taurine dioxygenase protein |
Protein accession | YP_002799405 |
Protein GI | 226944332 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCTT CCGCCCGCCT GCATCAGGCC GCGCTGCCCG CCGCCGAGGC GATCGTCGTC ACCCCGCTGT CCCTGTACAT CGGCGCCCAG GTCGACGGGG TCGACCTGTC GCGCCCGTTG CCCTCCGCCC AACGCGAAGC CATCCGCGCC GCCCTGCTGC GCTGGAAGGT GCTGTTCTTC CACGACCAGC ACCTCGACCA CGCCCAGCAG GTGGCGTTCG GCCGGCAATT CGGCGAACCG ACCGTCGGCC ACCCGGTGTT CGGCCATGTC GAGGGTCACC CGGAGATCTA TTCGGTCGGC CGCGACCGCT TCAAGGCGCG CTTCACCGAC GAGCGCCTGG TGCGCCCCTG GAGCGGCTGG CACACCGACG TGAGCGCGGC GCTCAATCCG CCGGCGGCGG CGATCCTGCG CGGCGTGGAC ATCCCGCCCT ATGGCGGCGA CACCCAGTGG ACCGACCTGG TGGCCGCCTA CAATGGCCTG TCGCCGACCC TGCGGGCCTT CGTCGACGGC CTGCGCGGCG AGCACCGCTT CACCCCGCCG GAAGGCGCCG AGGCACGGCC GGGCTTCAGC GAGCCGCTGG CGGTCAGGCC GCTGGTCAGC GAGCATCCGC TGGTGCGCGT GCACCCGGAA ACCGGCGAGA AGGCGCTGTT CGTCAGCCCG ACCTTCCTCA AGCGCATCGT CGGCCTCAGC CCGCGCGAGA GCGAACAGTT GCTGGAACTG CTCTTCGAAC ACGCGATCCG CCCCGAATAC ACGGTGCGCT TCAAGTGGCG ACCCGGCTCG CTGGCCTTCT GGGACAACCG GGTCACCGCC CACCAGCCGC CATCCGACAT CCACGCCACC GACCTGCCCC GCCAGCTCTA CCGCATCACC CTGGTCGGCG ACATTCCCGT CGGCCCGGAC GGCCGTCCCT CCCGCACCAT CGCCGGCGAG CCGGTGCTGG CTCATCCGGC CGCCTGA
|
Protein sequence | MPSSARLHQA ALPAAEAIVV TPLSLYIGAQ VDGVDLSRPL PSAQREAIRA ALLRWKVLFF HDQHLDHAQQ VAFGRQFGEP TVGHPVFGHV EGHPEIYSVG RDRFKARFTD ERLVRPWSGW HTDVSAALNP PAAAILRGVD IPPYGGDTQW TDLVAAYNGL SPTLRAFVDG LRGEHRFTPP EGAEARPGFS EPLAVRPLVS EHPLVRVHPE TGEKALFVSP TFLKRIVGLS PRESEQLLEL LFEHAIRPEY TVRFKWRPGS LAFWDNRVTA HQPPSDIHAT DLPRQLYRIT LVGDIPVGPD GRPSRTIAGE PVLAHPAA
|
| |