Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_22330 |
Symbol | |
ID | 7761151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2234132 |
End bp | 2235046 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643805118 |
Product | taurine dioxygenase protein |
Protein accession | YP_002799399 |
Protein GI | 226944326 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTGG ATATCCGTCC CGTCACCGGC CGCATCGGCG CCATCGTCAG CGGGGTGCGC CTGACCGATC TGGACGAGGC GCAATTCGCC GAGCTGCAGC AGGCCGTGCT CAAGTACAAG GCACTGTTCC TGCGCGACCA GCATCTGACC GACGCCGAGC AGGAAGCCTT CAGCCGCCGC TTCGGCGATC TGGTGGTGCA TCCGACCACC CCCACGGAAC AGAACACCGC CGGTATCCTC GAAGTGGACT CCGAGCGCTC GCGGGCCAAC TCCTGGCACA CCGACATCAG CTTCGTCGTC GACTACCCGA AGATCACCAT CCTGCGCGGC GAGGTGATCC CCGAAGCGGG CGGCGACACC GTGTTCGCCA ACACCGTCAC GGCCTACCAG GAACTGCCCG AGCCGCTGCA GCGGCTGGCC GACAGCCTGT GGGTGCGCCA CACCAACGAC TACGACTACG CCGCGCCCAA GCAGATCGAA GTGGTGGCCA ACAAGCGCTT CCGCGAGCAG TTCACCTCCA CCATCTACGA GAGCGAGCAC CCGCTGGTGC GCGTGCACCC GGAAACCGGC GAGAAGGCAC TGCTGCTCGG CCACTTCGCG GAGAGGATCG TCGGCCTCAA CTCGCGCGAC TCGCGGGCGC TGCTGGACCT GTTCCAGTCG CACATCGTCA AGCTGGAGAA CATCGTGCGC TGGCGCTGGA GCGAAGGCGA CGTGGCGCTG TGGGACAACC GCGCCACCCA GCACATCGCC ATCGACGACT ACGGCAACGC CCGGCGCATC GTGCGCCGCA CCACGGTGCT CGGCGAGATC CCGGTGTCGG TCGGCGGCGA GAGCAGCCGG GCGATCAAGC CCAGCCCGGA AAGCCGCATC CCGCGCAGCG AGGAGGATAA GGTCGCGCTG AAGAAGGCCG GCTGA
|
Protein sequence | MSLDIRPVTG RIGAIVSGVR LTDLDEAQFA ELQQAVLKYK ALFLRDQHLT DAEQEAFSRR FGDLVVHPTT PTEQNTAGIL EVDSERSRAN SWHTDISFVV DYPKITILRG EVIPEAGGDT VFANTVTAYQ ELPEPLQRLA DSLWVRHTND YDYAAPKQIE VVANKRFREQ FTSTIYESEH PLVRVHPETG EKALLLGHFA ERIVGLNSRD SRALLDLFQS HIVKLENIVR WRWSEGDVAL WDNRATQHIA IDDYGNARRI VRRTTVLGEI PVSVGGESSR AIKPSPESRI PRSEEDKVAL KKAG
|
| |