Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_29530 |
Symbol | arsC |
ID | 7761855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3043656 |
End bp | 3044885 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643805827 |
Product | Type III polyketide synthase |
Protein accession | YP_002800095 |
Protein GI | 226945022 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3424] Predicted naringenin-chalcone synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA TGGCCCACCC CAACAGCGCG GTCCTGGCCG ATTTCATCCC GGTCCAACTG GCCAAGCCGG TGCCCCAGCG CATCACCCTC GAACTGACCG CCTATGGTTT CGCCAGGGCC CACTGCCTGA GCAACGGCAT CACCGACGAG GAGGGCTTCG TCCAGGTCTA CAAGACGGTG AAGGAGAAAT TCGACAAGTA CGCCGTGTCG CCCGCGCAGA TCAAGCAGCG GCAACTGGTC TATTTCCCGA AGCTGACCGA CATCCGCTTC GGCGACGGCA ACTTCGACAT CGCCGACCCG GAGCCGGACC AGGCCCACCT CAGGCTGTTC GACATCAAGA AGGACCCGCG CGGAGCCGAT CTGAAGACCC GTCACGAGAG TTACGCCAAG GTCGTCGGCA AGGGGCTGGA GCAGATGTTC GAAGGCACCC TCGAGGCGCC CGACGATCTG ATCCACGTGA CCTGTTCGGG TTATCTGGCG CCGAGTCCGG CCGAGCGCAT GGTGGCCGAC CGCGGCTGGT TCGAGACCAC GGTGACCCAC AGCTACAACA TGGGCTGCTA CGGCGCCTTT CCGGCGATCA AGATGGCCCA CGGCATGCTC GCCTCGGCGC AGTGGGGGGC CACTCCGCCG AAGACCCGGG TGGACATCGC GCATACCGAG CTGATGTCCG CGCACAACAA TATCGCCGAG TCGCGGGTGG ACAACATCAT TTCGGCGACC CTGTTCTCCG ACGGGCTGAT CAAGTACTCG GTCTACCCCG AGGACGAACT GCGCCGCCAG GGGCTGCGCG GCCTGCGCAT CCTGGCGATG AGCGAGCACC TGCTGCCGGA CTCGGCCGAC ACCATGACCG GGGTGCCGGG CTCGCACCAG TTCGTCATGA CCCTCTCACC CCTGGTGCCG GCGATCATCA AGCGCCATGT GCGGGCCTTC GCGGTGGACC TGCTGCGGCG CGCCGGCATG GACTTCGAGC GCGACAAGGA CGCCCTGAGC TTCGCCATCC ACCCCGGCGG ACCGAAGATC GTCGACCACG TCCAGGAGGA ACTCGGCCTG GCCGAGGACC AGGTGGCGAT CAGCAAGAGC GTATTCCTGG AGAACGGCAA CATGTCCTCT TCCACGATTC CGCACATCCT CAAGGCGTAT CTGGAAGAGG CCACCGTCGG CACCCGTATC GCATGTCTCG GCTTCGGGCC GGGGCTGACC GCGGCCGGAC TGGTTCTGGA GAAGATATGA
|
Protein sequence | MNDMAHPNSA VLADFIPVQL AKPVPQRITL ELTAYGFARA HCLSNGITDE EGFVQVYKTV KEKFDKYAVS PAQIKQRQLV YFPKLTDIRF GDGNFDIADP EPDQAHLRLF DIKKDPRGAD LKTRHESYAK VVGKGLEQMF EGTLEAPDDL IHVTCSGYLA PSPAERMVAD RGWFETTVTH SYNMGCYGAF PAIKMAHGML ASAQWGATPP KTRVDIAHTE LMSAHNNIAE SRVDNIISAT LFSDGLIKYS VYPEDELRRQ GLRGLRILAM SEHLLPDSAD TMTGVPGSHQ FVMTLSPLVP AIIKRHVRAF AVDLLRRAGM DFERDKDALS FAIHPGGPKI VDHVQEELGL AEDQVAISKS VFLENGNMSS STIPHILKAY LEEATVGTRI ACLGFGPGLT AAGLVLEKI
|
| |