Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_09690 |
Symbol | |
ID | 7759913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 918662 |
End bp | 922828 |
Gene Length | 4167 bp |
Protein Length | 1388 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643803874 |
Product | Non-ribosomal peptide synthetase, with condensation, AMP binding and thioesterase modules |
Protein accession | YP_002798176 |
Protein GI | 226943103 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGGCT TGTCTGCACT AATCGAATTA GCGGAAAAAG GATGGGCATT CGAGCGGGTC AGTGCCGAGC GCTTGGATAT TTGTCGGTCG ACTAATAATG AAGGCGAGGC ACTGTGGCAG CATAGCAAAG CGCAGATTTT TGAGTTTTTG AGTGACACCG AAAAAAGTCA AGATGAGCGG ACAGGTATCC GTTCGGTCAG GCCGGGCGAA ACGTTTGTGG TAAACCGTTA TGCTGGCCAG TATCTATGGG TGTTCGATAA CCTCAGACAC CATAGTGCGC TCTACAATAT TCCGTTGTGC AAGCGGATAA GCGGGTCAGT GGATGTAGAG CGCTTAATTT ACGCGTTGTC TCGACTGGTG CAGCACCATG GCATCTTCGC TACGGTCTAT CAGTTGCAGG AGGCACAGTT GCATATGTCT ACTAGGGTAT TACCTACCAG TTTGTTTGCT GACGCCCTGA CCGACATTTC GGGACTGAGT ACTTCACAAC AGCAGCAATG GTTGGAAAAC TTCACCTTGG AATGCAGCCA GTGTGCGTTC GATCTCAGTT GTGAGGTTCC AATCCGCTGT CGTATCGTTC GTACTAGAGA TGACGAGCAT TGGTTGTTCC TGACGTTCCA TCATTCGGCG GTCGATGGCT GGTCGATTGG TCAGTTCCTC AACGAGTTGA GCGCCACCTA TCAAGATCTC TCTTTCGTAC CGGACTGTTT ACCCGCTGCT TGGAGTTACG CTCAGAATCC GTTTGCCTTC ATAGCCTCGG TCGACGAAAG CTTGGCATTT TGGATGAAGG AACTTGAGCA TGCTCCGGCT CGCCACGGCC TAGCCTATGA TGCTGTAGAG AAGCATACTG ATTTACACAA AAACGTAGTG ACTTCGATAC TTTCCGAGGC CTCTTTGCAA TCCTTAAAAG GCTTTTCCCG GGCCCATGGG GTTTCGATCT CGGTTGTACT GCAAGCCACC TTTGTTCTAC TGCTATCGAA GCTGAGTCGG CAACGTCGTA TTGTAATCGG TACTCCAGTG GCCAATCGCC CCCATGTTGA ACTTAACACT GCATTGGGTA GTTTTGTAAA CACTATTGCA CTTTATTTTA ATCTAGAGGA AGTACAAAGC TTTTCTGATC TTGTGCAAGT AGCTCAAGAT AAATTAATAC GCTCGCATCC GCACCATGGT TTGCCATTCT CCTACGTGGT TGAGCAGCTT CGTCCAGAAC GTGGTGATTT CAATCCAATT TACCAGATTC TTTTCGTTTG CCAACACCAG CAGGATAGCC ACCTGCAGTT TGTGGATGCA CAAGTTCATG ATGTACAACG AACCTACTCA ACCCCTAAAA GTGACCTTGC ACTCGAGGTC ATCCTGCACC AGGATAGGAT AACGTTGGAG TGGCAGTTCC ATCCCAACTA CTTCTCCTTT GACAGGATTG CGTCGTTTGC CCAGCATTTT CTCAACTTGC TTGAGCAGGT TATGAAGGCT CCCTCACTAC CGCTAGGTTG CTTCGGTCTC ATGTCTCCGA AGCGAAGGAA GACCTTACTC GAACTGTCGA TGGGGGACCA GGTTGATGTG TTTGTTGGAC AAACACTCGA TCGACTGATG GAGCAGTCAA AATCACGTTT CGCCGACAAT ATTGCGGTGA TCGATGGTAA TGATCAATAT ACCTATGCGC AGCTGTTTGC CGGTGCTAAG AGCCTAGCCG GATATCTTGA CTCGCTCTGC GAGCCACAGG CGATTGTCGC TGTCCAGATC AAGCGGGGCT ATCTGCAGGC CCTGCTGTTG CTTGCTACCG TGCTTAGTGG GCGAGTTTAC CTTCCGCTGG CTATGGACAC GCCGATATCG CGTGCCCGCA GTATCCTTGA GTCATCAGGT TGCACCCTGC TGATCGGCGA TGTGCTCGAT GAACAGATGT ACCCAGGTAT TCGGGTATTG CCCTCGCGGA TACTGTGGTG CCAGCTTGAG CATGCTCCAT TTACCCGTGA TACATCGATC AGCCCCTCTG ATTTAGCATA TTTGATCTAC ACCTCGGGAA CCACAGGGAC ACCCAAGGGC GTCGCCATCG AACATGCGGC GGTGTGCAAT ACACTATTGG CAATGAACCA ATATTTCGGC GTGTCGCAAC ACGACAGGGT TTTGGCCATT TCCAACATAA GTTTCGACTT ATCTGTATAT GACCTGTTCG GTACTTGGAC TGCTGGTGCC TGCGTGGTCC TGTTGTCCGA GTCGGCAAGT AAGGATCCTG CCAGTTGGGT ACAGGCCATT CATGCGAACC ACGTCAGCGT ATGGAACTCA GTGCCCATGG TACTACAAAT GATGCTCGCT TTCGTTCAGG GGTTGAGACT GAATACCTTT CCGGGAGTCC GACACATTTG GTTGAGTGGC GATTGGATTC CGCCGAAGCT AATCGAACAG GCTAGATGCT GCTTTCCACA GGCCAAAATC ATTAGCTTGG GCGGAGCTAC CGAAGGCTCC ATCTGGTCGA TCTACCATCC ACTGCAAGAC CAAGTCTACC TTGGCAGTAT CCCCTACGGT AGAGCATTGC CCAACCAAGG TATGTTCGTC CTAGATGAAC AACTGGAGCT TTGTGATTTC GGGGTCAGTG GCGATATTTA CATTGCTGGT TACGGTGTCG CTCGAGGTTA TCACCAAGCC CCCCGGTTGA CCGAGAGTAA ATTTACCGTC CATCCCCAAC TGAAGCAGCG CCTATACAAG ACCGGTGATC GTGGACGTTG GCATACAGCG GGTTACATCG AGTTTCTGGG ACGGGAGGAC AAGCAGGTCA AGATCCAAGG GTACAGGGTC GAACTTGGCG AAGTCGAATC TGTGCTAAAA CGCGCTAGTT TCGTACGTGA TGCAGTTGTG TTGATTCGAT CCTCAACGGG GGGAGGCGGT TCCTACCTAG AGGCCCATAT CGTTGCGTCG CCGTTGACAG CACAGTTAGA GCCGACGCTT CGAGCACATG CGGCATTGCT GCTAAGCCCT TACATGCAAC CGTTGCACTA TGGTTTTTAT GAGCAATTCC CGTTGTCGGC CAATGGCAAG GTGGATACTA GCCGCTTGAG GAGGCTGGCA CCGATGCGTG CGTCATCCAC TTCTGGATGG GATGCGGATG AACACCTTTT CAAGCTCATG GAAATTGTTT CTGTGGTTCT AGACAGACCA ATGGCGGAGC TCGATCCACA GCACAGCTTC TATCAGTTGG GCGGCACATC TCTGCAGGCG GTTAGTCTGG CAGTCAAAGG AGCCACGCAT TGGCAAGTTA ACCTATCGAT CACGGACATC TTGGAGGCGT CCTCATTGGT CGAGCTGGCT GAGAAGATCC GTGTGGTACC TCTGGTTCTG TCGAAGCTAT CGACTTTCGA GCAGGCTTCT GTTGGCGCCT TGAGTTTGTG CTTCGTGCAT GCTGCCGGAG GGCATCTCGA GCCATATCGC ACACTCCAGA CACGCTATGC GGGGCGTTGC AATCTGTTCG GATTGAGCAG CCCTGCATTG GCCAGCATCG GTCCTGACTG CGAACTGGCT TTTGAAGCGT TGCTCGAAGC TCATGTCAAT TCGTTGCTGG ACATTCCCAG TCAAGGGCAC TTGGTGCTGG TTGGCTGGTC ATTGGGTGGC ATTCTAGCGA TGAACCTGTG TGAACGCTTG GTGCGCCATG GCATCAGAGT TAGACATGTA GTAGTAGTAG ACTCGGGTTT GGATTATGCG CCAATGGTTG GCAAGGGGCG CACCAAGCAG TGGTTGAGAC TGGTCGCCAA TGCGGTGGAG ATGTACGGAC TCGACAAGCA CTGCCTGATA GTGAAAGGGG CTGAGGGTTT CAGTTCGTTC GCTGGGCTGC TTAAGTACCT CTATACCGTC AATGAGGAGA AACTGAGCGC TGTGCTCTCA AGATCCCAGT TTGAAGTCAG CGCCTGGGCT TTGGAGCGAG CATGTCGGTT ATTGGCCGAG GCGAGTATCC CGAAGCTCGA TACGGCTTTG AGTGTTCACT TGTGCCAGAC CCATCAGACT ATTGCTTCAT CGCAGATTCT GCGTTGGCAG GATCTGAGTC CTAAGGTCGA TATCATCGAG CTGGCTGCTG AGCATAATAG TATCCTTAGT GTACCGTCCT TTCTGCGCTC TTTGGACAGC TTGCTCGAAA GTTTACAGGT TTCGGAAGGT AAAGCAATTG ACACGGCGAA CGGGTAG
|
Protein sequence | MDGLSALIEL AEKGWAFERV SAERLDICRS TNNEGEALWQ HSKAQIFEFL SDTEKSQDER TGIRSVRPGE TFVVNRYAGQ YLWVFDNLRH HSALYNIPLC KRISGSVDVE RLIYALSRLV QHHGIFATVY QLQEAQLHMS TRVLPTSLFA DALTDISGLS TSQQQQWLEN FTLECSQCAF DLSCEVPIRC RIVRTRDDEH WLFLTFHHSA VDGWSIGQFL NELSATYQDL SFVPDCLPAA WSYAQNPFAF IASVDESLAF WMKELEHAPA RHGLAYDAVE KHTDLHKNVV TSILSEASLQ SLKGFSRAHG VSISVVLQAT FVLLLSKLSR QRRIVIGTPV ANRPHVELNT ALGSFVNTIA LYFNLEEVQS FSDLVQVAQD KLIRSHPHHG LPFSYVVEQL RPERGDFNPI YQILFVCQHQ QDSHLQFVDA QVHDVQRTYS TPKSDLALEV ILHQDRITLE WQFHPNYFSF DRIASFAQHF LNLLEQVMKA PSLPLGCFGL MSPKRRKTLL ELSMGDQVDV FVGQTLDRLM EQSKSRFADN IAVIDGNDQY TYAQLFAGAK SLAGYLDSLC EPQAIVAVQI KRGYLQALLL LATVLSGRVY LPLAMDTPIS RARSILESSG CTLLIGDVLD EQMYPGIRVL PSRILWCQLE HAPFTRDTSI SPSDLAYLIY TSGTTGTPKG VAIEHAAVCN TLLAMNQYFG VSQHDRVLAI SNISFDLSVY DLFGTWTAGA CVVLLSESAS KDPASWVQAI HANHVSVWNS VPMVLQMMLA FVQGLRLNTF PGVRHIWLSG DWIPPKLIEQ ARCCFPQAKI ISLGGATEGS IWSIYHPLQD QVYLGSIPYG RALPNQGMFV LDEQLELCDF GVSGDIYIAG YGVARGYHQA PRLTESKFTV HPQLKQRLYK TGDRGRWHTA GYIEFLGRED KQVKIQGYRV ELGEVESVLK RASFVRDAVV LIRSSTGGGG SYLEAHIVAS PLTAQLEPTL RAHAALLLSP YMQPLHYGFY EQFPLSANGK VDTSRLRRLA PMRASSTSGW DADEHLFKLM EIVSVVLDRP MAELDPQHSF YQLGGTSLQA VSLAVKGATH WQVNLSITDI LEASSLVELA EKIRVVPLVL SKLSTFEQAS VGALSLCFVH AAGGHLEPYR TLQTRYAGRC NLFGLSSPAL ASIGPDCELA FEALLEAHVN SLLDIPSQGH LVLVGWSLGG ILAMNLCERL VRHGIRVRHV VVVDSGLDYA PMVGKGRTKQ WLRLVANAVE MYGLDKHCLI VKGAEGFSSF AGLLKYLYTV NEEKLSAVLS RSQFEVSAWA LERACRLLAE ASIPKLDTAL SVHLCQTHQT IASSQILRWQ DLSPKVDIIE LAAEHNSILS VPSFLRSLDS LLESLQVSEG KAIDTANG
|
| |