Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21160 |
Symbol | |
ID | 7761041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2108848 |
End bp | 2110146 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643805011 |
Product | Condensation domain protein |
Protein accession | YP_002799292 |
Protein GI | 226944219 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCA CGCCTTCCCG CTTGCCCGCC ACCACCGCCC AGTTCAGCAT CTGGGTCGCC CAGCAGGCCG TACCCGACAG TCCGGCCTAC CTCACGGCCG AAACCATCGA GCTGCTCGGC GCCGTCGATC GCGCGGCGCT GTCGGACACG GTGGTCGAGG TCCTGAACAA CTGCCACAGC CTGCACATGC GCTTCGTCCA CGAGGAAGGC CGGCTGTGGC AATTGCCGCA GCCGCCGGCC TGCCCGCCGC TGGCGCTGCA CGACTTCCGC CGCGAGCCCG ACCCGGCCGC CGCGGCCCAG GCCTGGATCG ACCGGGAACT GAGCCGCACC CTCGACCCGA GCCGCGACGT GCTCTACCGG AGCGCCCTGC TGCAACTGGG CGACCAGCAC CATCTCTGGT ACCTGCAGGC CCACCATGTG ACGCTGGACG GCTACGCCTA CGGCCTGGTC TGCCAGACCA TCGCCGAGCG CTACAGCGCC AGGGTCCGGC AAAGCGAGCT GCCGCCCCTG CCGGACTGGT CGATGGCGCG GGTGGTGCAG GCCGAGCAGG ATTACCGGGC CAAGGGACTG TTCGAGCGCG ACAAGCAGTT CTGGCTGCAG CAGCTGCGGG ATGTCCCGGC ACCGGCCACC ATCGCCGAGG CGGCGGAGTT TCCCGAGCAG GTCATTCGCA GCGAAACCCG GCTGCAACGC GACCAGGTGG CCATGCTGCA GGAGGCGGCC CGCGCCTGCG GGCAGGACTG GGGCAACTGG GTGCTGGCGG CCATCGGACT CTGGCTGGGC AGGCAATCCG GCCAGCAGGC GCTGACCTTC GCCCTGCCGG CGATGAACCG GCTGGGCACC CCGGCGCTGG GCGTACCCTG CATGGCGATG AACATAGTGC CGCTGAGCCT GCGCATCGAC CCGGCCGGCT CGATGGCCAG CCATGCGCAG CAGATCGCCG GCGAGATGCG CCGCATCCGC CCGCACCTGT ATTACCGCTA CGGCTGGATG CGCATGGACC TGGGCCTGAT GGAGGCGCAG AAGCACCTGT TCAACCAGGC GGTCAACCTC ATGCCCTTCG ACCGCAAGGC CGCCTTCGCC GGGCTGGACA GCCGCATCAG GCCCATCACC GCCGGACCGG TCAAGGACCT CAACGTCACC CTGGCGGTAC TGGACGCCGA GTGGCGGCTC TGCGTCGAAG CCAATCCCCA CGCCTACTCC AGCGACCGAC TGGAGCAATT GCACGGGGAC CTGCTGGACT GGCTGCAACG GCTGGCCAGG CACGCCCCGG AGGCGCCCCT GCAGGCGCTG TGGCAATAA
|
Protein sequence | MNATPSRLPA TTAQFSIWVA QQAVPDSPAY LTAETIELLG AVDRAALSDT VVEVLNNCHS LHMRFVHEEG RLWQLPQPPA CPPLALHDFR REPDPAAAAQ AWIDRELSRT LDPSRDVLYR SALLQLGDQH HLWYLQAHHV TLDGYAYGLV CQTIAERYSA RVRQSELPPL PDWSMARVVQ AEQDYRAKGL FERDKQFWLQ QLRDVPAPAT IAEAAEFPEQ VIRSETRLQR DQVAMLQEAA RACGQDWGNW VLAAIGLWLG RQSGQQALTF ALPAMNRLGT PALGVPCMAM NIVPLSLRID PAGSMASHAQ QIAGEMRRIR PHLYYRYGWM RMDLGLMEAQ KHLFNQAVNL MPFDRKAAFA GLDSRIRPIT AGPVKDLNVT LAVLDAEWRL CVEANPHAYS SDRLEQLHGD LLDWLQRLAR HAPEAPLQAL WQ
|
| |