Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_29550 |
Symbol | arsB |
ID | 7761856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3044882 |
End bp | 3046102 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643805828 |
Product | Type III polyketide synthase |
Protein accession | YP_002800096 |
Protein GI | 226945023 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3424] Predicted naringenin-chalcone synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.96869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGTC CCCACAACGC AGTTCTCACC GGCTTCACAC CCGTGCAACT GGCCAAACCC GTTCCCCAGG CGCTGACCCT GGAGCTGTCC GCCTATGCCT TCGCCCGCGC CTACTGCATC AAGAACGGCG TCGGGACGGA CGACGAGGCG GGCTTCGCCA AGGTCTACCA GTCGGTCAAG GAGAAGTTCG ACAAGTACGC CCTGTCGTCC GCGCAGATCA AGCGGCGTCA ACTGATCTTC TTTCCGAAGG TCTCCGACAT CCATTTCGCC AACGGTCACG TCGATATCGC GGCGCCCGAG CATGCCTACC TGAAGCTGTA CGACATGGCC ACCGACCCGC GCGGCTCCGA CCTCAAGGTC CGCCACGAAA GCTACGCCAA GGTCGTCGAC CAGGGGCTGG AGCGGATGTT TCAGGACAGT GCCGAAGCCC CCGACGATCT GATCCACGTG ACCTGCTCGG GCTACCTGTC GCCGAGTCCG GTCGAGCGCA TGGCGGCCGA TCGCGGCTGG TTCGAGACCA CGGTGACCCA CAGCTATCAC ATGGGCTGCT ACGGCGCCTT CCCGGCGATC AAGATGGCCC ACGGCATGCT TTCCTCGTCG CGCTTCGGCG TCACCCCGGT CAAGCACCGG GTGGATATCG TGCATACCGA GCTGCTCTCG GCGCACAACA ACATCGTCGA TGCGCGGGCG GAAAACATCA TCACCATGAC CCTGTTCGCC GACGGCCTGA TCAAGTACTC GGTGCTCTCC GAGGAGGAAC TGCACCGCCA GGGCGGACAC GGCCTCCGGG TCCTGGCGAT GAACGAGCAC CTGCTGCCCG ACTCGGCCGA CGAGATGACC TGGGTGCCGG GCTCGCACCA GTTCCTGATG ACCCTCACGC CCATGGTGCC GGTGGTCATC AAGCGCCATG TCCGCGACTT CGTGGTGAAG CTGCTCGAAC GCGCCGGCAT CGACTACGAG CGGGAGAGGC TCGAACTGAC CTTCGCCATC CATCCCGGCG GACCGAAGAT CGTCGAGCAC ATCCAGGAGG ATCTGGGGCT CAGCGACGAG CAGGTGGCGA TCAGCAAGAG CGTCTTCCTG GAGAACGGCA ACATGTCCTC CGCCACGATT CCCCACATCC TCAAGCAGGT CCTCGAGGAG GTGGACGTGG GCACGCGCGT CCTGTGCCTG GGGTTCGGTC CCGGACTGAC TGTCACCGGA ATGGTGCTGG AGAAAATATG A
|
Protein sequence | MSSPHNAVLT GFTPVQLAKP VPQALTLELS AYAFARAYCI KNGVGTDDEA GFAKVYQSVK EKFDKYALSS AQIKRRQLIF FPKVSDIHFA NGHVDIAAPE HAYLKLYDMA TDPRGSDLKV RHESYAKVVD QGLERMFQDS AEAPDDLIHV TCSGYLSPSP VERMAADRGW FETTVTHSYH MGCYGAFPAI KMAHGMLSSS RFGVTPVKHR VDIVHTELLS AHNNIVDARA ENIITMTLFA DGLIKYSVLS EEELHRQGGH GLRVLAMNEH LLPDSADEMT WVPGSHQFLM TLTPMVPVVI KRHVRDFVVK LLERAGIDYE RERLELTFAI HPGGPKIVEH IQEDLGLSDE QVAISKSVFL ENGNMSSATI PHILKQVLEE VDVGTRVLCL GFGPGLTVTG MVLEKI
|
| |