Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21330 |
Symbol | |
ID | 7761058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2134328 |
End bp | 2135344 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643805028 |
Product | Taurine catabolism dioxygenase, TauD/TfdA family |
Protein accession | YP_002799309 |
Protein GI | 226944236 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.545392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGAC TGCAAACGAT TGCCGATCTG TCCTGGCCGC GCCGCGCCGA CACAGCGGAG CCCAACAGCC TCGCACGGCT GTCGGTGACG CCCACCGGCA AGGCCCTCGG CGCGGTGGTG ACCGGCGTGG ACCTGGCGCA GCCGCTGTCC GCCGACATCG CCGACGCCTT GCGACGCGCC TGGCGGGAAC ACCTGGTCCT GCTGTTTCCG GGCCAGTTCC TGGAGCCGGA AACGCTGCTG CGCGCCGCCT CCACCTTCGG CCAGCCGCAG GAAGGCGCTA ACCGCCTCTA TATCCGGGCG GCCGGCATCG CCCAGGAAGA GCGGTTCCCG GCCCTGCTGC CGATCACCAA CCTGGGCCCG GACGGCACGC CGGTGCGCGA GAACGACGGC CTCGGCAGCC TGGAGGTGGT CTGGCACTCG GACAACTCCT ACATCGAAGC GCCGCCGATC GGCTGCCTGC TGTATGCGCT GGAGGCACCG GCGGACAGCG GTTTCACTTC GTTCGCCAAC CAGTTCCTGG CCTACGAGCG CCTGTCCGAG ACGCTCAAAC GCGACATCGA GGGGCGCTGG GCCAAGCACG ACGCCAGCCG CAACAGCGCC GGCATGCTGC GCCCCGGCCT GCGCACGCCG AGCCGCCCGG AAGAGGTGCC GGGGCCCTTC CATCCGCTGG TCATCCGCCA GCCCGGGAGC GCTCGGCGCG CCCTGTTCCT GGGGCGGCGG CGGATCTTCC CCTCGCAATA CATCGAAGGG CTGCCCGGCG TCGAGAGCGA GGCGTTGCTG GACGCCCTGT GGGCGGCCGC GACCCATCCG GATATCACCT GGACGCACCG CTGGACACCC GGCGACGTAC TGCTCTGGGA CAACTGCCAT ACCTTGCACC ACCGCACCCC GGTCGACGCG ACGCGCCGGC GGGTCATGGT GCGTACCCAG TTCCAGGGAC AGACGCCGCG GGCGGACGGC CAGCGGCGCC ATTTCACCGC AAGCGAACCG AATCACCCGC AGGAGCTGTC GGCATGA
|
Protein sequence | MKGLQTIADL SWPRRADTAE PNSLARLSVT PTGKALGAVV TGVDLAQPLS ADIADALRRA WREHLVLLFP GQFLEPETLL RAASTFGQPQ EGANRLYIRA AGIAQEERFP ALLPITNLGP DGTPVRENDG LGSLEVVWHS DNSYIEAPPI GCLLYALEAP ADSGFTSFAN QFLAYERLSE TLKRDIEGRW AKHDASRNSA GMLRPGLRTP SRPEEVPGPF HPLVIRQPGS ARRALFLGRR RIFPSQYIEG LPGVESEALL DALWAAATHP DITWTHRWTP GDVLLWDNCH TLHHRTPVDA TRRRVMVRTQ FQGQTPRADG QRRHFTASEP NHPQELSA
|
| |