Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3855 |
Symbol | |
ID | 3678495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 4798900 |
End bp | 4801566 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637719207 |
Product | amino acid adenylation |
Protein accession | YP_324355 |
Protein GI | 75910059 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACTA TAGATTTTAA TATTCGTAAG TTACTTGTAG AGTGGAACGC GACCCACAGA GATTATGATC TTTCCCAGAG TTTACATGAA CTAATTGTAG CTCAAGTAGA ACGAACACCT GAGGCGATCG CTGTCACCTT TGACAAGCAA CAACTAACTT ATCAAGAACT AAATCATAAA GCAAACCAGC TAGGACATTA TTTACAAACA TTAGGAGTCC AGCCAGAAAC CCTGGTAGGC GTTTGTTTAG AACGTTCCTT AGAAATGGTT ATCTGTCTTT TAGGAATCCT CAAAGCTGGG GGTGCTTATG TTCCTATTGA CCCTGAATAT CCTCAAGAAC GCATAGCTTA TATGCTAGAA GATTCTCAGG TGAAGGTACT ACTAACTCAA GAAAAATTAC TCAATCAAAT TCCCCACCAT CAAGCACAAA CTATCTGTGT AGATAGGGAA TGGGAGAAAA TTTCCACACA AGCTAATACC AATCCCAAAA GTAATATAAA AACGGATAAT CTTGCTTATG TAATTTACAC CTCTGGTTCC ACTGGTAAAC CAAAAGGTGC AATGAACACC CACAAAGGTA TCTGTAATCG CTTATTGTGG ATGCAGGAAG CTTATCAAAT CGATTCCACA GATAGCATTT TACAAAAAAC CCCCTTTAGT TTTGATGTTT CCGTTTGGGA GTTCTTTTGG ACTTTATTAA CTGGCGCACG TTTGGTAATA GCCAAACCAG GCGGACATAA AGATAGTGCT TACCTCATCG ATTTAATTAC TCAAGAACAA ATCACTACGT TGCATTTTGT CCCCTCAATG CTGCAAGTGT TTTTACAAAA TCGCCATGTA AGCAAATGCA GCTCTCTAAA AAGAGTTATT TGTAGCGGTG AAGCTTTATC TATAGATTTA CAAAATAGAT TTTTCCAGCA TTTGCAATGT GAATTACATA ACCTCTATGG CCCGACAGAA GCAGCAATTG ATGTCACATT TTGGCAATGT AGAAAAGATA GTAATTTAAA GAGTGTACCT ATTGGTCGTC CCATTGCTAA TACTCAAATT TATATTCTTG ATGCCGATTT ACAACCAGTA AATATTGGTG TCACTGGTGA AATTTATATT GGTGGTGTAG GGGTTGCTCG TGGTTATTTG AATAAAGAAG AATTGACCAA AGAAAAATTT ATTATTAATC CCTTTCCCAA TTCTGAGTTT AAGCGACTTT ATAAAACAGG TGATTTAGCT CGTTATTTAC CCGATGGAAA TATTGAATAT CTTGGTAGAA CAGATTATCA AGTAAAAATT CGGGGTTATA GAATTGAAAT TGGCGAGATT GAAAATGTTT TATCTTCACA CCCACAAGTC AGAGAAGCTG TAGTCATAGC GCGGGATGAT AACGCTCAAG AAAAACAAAT CATCGCTTAT ATTACCTATA ACTCCATCAA ACCTCAGCTT GATAATCTGC GTGATTTCCT AAAAGCAAGG CTACCTGATT TTATGATTCC AGCCGCTTTT GTGATGCTGG AGCATCTTCC TTTAACTCCC AGTGGTAAAG TAGACCGTAA GGCATTACCT AAGCCTGATT TATTTAATTA TAGTGAACAT AATTCCTATG TAGCGCCTCG GAATGAAGTT GAAGAAAAAT TAGTACAAAT CTGGTCGAAT ATTCTGCATT TACCTAAAGT AGGTGTGACA GAAAACTTTT TCGCTATTGG TGGTAATTCC CTCAAAGCTC TACATTTAAT TTCTCAAATT GAAGAGTTAT TTGCTAAAGA GATATCCTTA GCAACACTTT TAACAAATCC AGTAATTGCA GATTTAGCCA AGGTTATTCA AGCAAACAAC CAAATCCATA ATTCACCCCT AGTTCCAATT CAACCACAAG GTAAGCAGCA GCCTTTCTTT TGTATACATC CTGCTGGTGG TCATGTTTTA TGCTATTTTA AACTCGCACA ATATATAGGA ACTGACCAAC CATTTTATGG CTTACAAGCT CAAGGATTTT ATGGAGATGA AGCACCCTTG ACGCGAGTTG AAGATATGGC TAGTCTCTAC GTCAAAACTA TTAGAGAATT TCAACCCCAA GGGCCTTATC GTGTCGGGGG GTGGTCATTT GGTGGAGTCG TAGCTTATGA AGTAGCACAG CAGTTACATA GACAAGGACA AGAAGTATCT TTACTAGCAA TATTAGATTC TTACGTACCG ATTCTGCTGG ATAAACAAAA ACCCATTGAT GACGTTTATT TAGTTGGTGT TCTCTCCAGA GTTTTTGGCG GTATGTTTGG TCAAGATAAT CTAGTCACAC CTGAAGAAAT AGAAAATTTA ACTGTAGAAG AAAAAATTAA TTACATCATT GATAAAGCAC GGAGCGCTAG AATATTCCCG CCTGGTGTAG AACGTCAAAA TAATCGCCGT ATTCTTGATG TTTTGGTGGG AACTTTAAAA GCAACTTATT CCTATATAAG ACAACCATAT CCAGGAAAAG TCACTGTATT TCGAGCCAGG GAAAAACATA TTATGGCTCC TGACCCGACC TTAGTTTGGG TAGAATTATT TTCTGTAATG GCGGCTCAAG AAATTAAGAT TATTGATGTC CCTGGAAACC ATTATTCGTT TGTTCTAGAA CCCCATGTAC AGGTTTTAGC ACAGCGTTTA CAAGATTGTC TGGAAAATAA TTCATAA
|
Protein sequence | MQTIDFNIRK LLVEWNATHR DYDLSQSLHE LIVAQVERTP EAIAVTFDKQ QLTYQELNHK ANQLGHYLQT LGVQPETLVG VCLERSLEMV ICLLGILKAG GAYVPIDPEY PQERIAYMLE DSQVKVLLTQ EKLLNQIPHH QAQTICVDRE WEKISTQANT NPKSNIKTDN LAYVIYTSGS TGKPKGAMNT HKGICNRLLW MQEAYQIDST DSILQKTPFS FDVSVWEFFW TLLTGARLVI AKPGGHKDSA YLIDLITQEQ ITTLHFVPSM LQVFLQNRHV SKCSSLKRVI CSGEALSIDL QNRFFQHLQC ELHNLYGPTE AAIDVTFWQC RKDSNLKSVP IGRPIANTQI YILDADLQPV NIGVTGEIYI GGVGVARGYL NKEELTKEKF IINPFPNSEF KRLYKTGDLA RYLPDGNIEY LGRTDYQVKI RGYRIEIGEI ENVLSSHPQV REAVVIARDD NAQEKQIIAY ITYNSIKPQL DNLRDFLKAR LPDFMIPAAF VMLEHLPLTP SGKVDRKALP KPDLFNYSEH NSYVAPRNEV EEKLVQIWSN ILHLPKVGVT ENFFAIGGNS LKALHLISQI EELFAKEISL ATLLTNPVIA DLAKVIQANN QIHNSPLVPI QPQGKQQPFF CIHPAGGHVL CYFKLAQYIG TDQPFYGLQA QGFYGDEAPL TRVEDMASLY VKTIREFQPQ GPYRVGGWSF GGVVAYEVAQ QLHRQGQEVS LLAILDSYVP ILLDKQKPID DVYLVGVLSR VFGGMFGQDN LVTPEEIENL TVEEKINYII DKARSARIFP PGVERQNNRR ILDVLVGTLK ATYSYIRQPY PGKVTVFRAR EKHIMAPDPT LVWVELFSVM AAQEIKIIDV PGNHYSFVLE PHVQVLAQRL QDCLENNS
|
| |