Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4746 |
Symbol | |
ID | 3679633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5959995 |
End bp | 5962946 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637720102 |
Product | amino acid adenylation |
Protein accession | YP_325238 |
Protein GI | 75910942 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTCA CTCTAGAAAT ACCCTATTAT CAAATTATTC AGCAAGCGCA ATTAACACCG GATGCGATCG CTGTTTTTGA TGAAACCAAT CCCCTGACTT ATCAACAGTT AGATCGTCTC TCGAATCAAG TGGCTACTTA TTTACGCACC CAAGGAGTCG CCCCCAATAC GAGAGTCGGG ATTATGGCTG AACGGGGAGC TAGAATGATG ATTGGCATTT TAGGCATTCT CAAAGCTGGC GGGAGTTATA TCCCCCTTGA CCCTGGTTAT CCCACTGACC GACTGCGTTA CATCCTAGAA CACGCAACTA TCCAGACTTT ACTTACAGAA CATCAAGTTA GTCCACAATT GGTGGCTTGT GTTCAGGAGG CGCTACCTTT ACAAACGGTG ATGTTTTTAG ATGAGGGTGA ACCGTTAGAG GAAATTCAAT CATTAAATCA AATTACTGCG GAAACTTGGC AAGGCTTGAG TAAGGACACT TTAGAACTGT GTAACAGTCC TGACGATTTA ATGGTGATTC TCTACACCTC TGGCTCAACG GGTCGTCCCA AAGGGGTAAT GCTGAATCAT CGAGGTTACA TGAATCGTCT GACATGGATG CAGAATACTT TTTCATTACA ACCAGGCGAT CGCGTAGCCC AAAGAACCTC TTTTTGCTTT GATATCTCGG TTTGGGAAAT TTTCTGGACA TTGATGTCAG GGGCGACTAT TTGTCCTGTG CAACGGGAAG TAGTTCTCAA TCCTTGGGAA TTTGCTCAGT GGATTCAACA AACTCAAATT AATGTGATGC ACTTTGTTCC TTCCCTGTTT GGGGAGTTTA TTAGTGCTTT AGAAAATGAA ACTTGGAGTT TTCCCCAATT ACGTTGGTTG ATGTTTAGTG GGGAAGCATT ACCTATGTCC TTTATCCAGA GGTGGATTGA CCGTCATGGA CTCAAAACCG GGCTAGCCAA TCTTTACGGC CCTACAGAAG CTTCCATTGA CGTTACTTGT CACTTAATTA CAGAACGTCC TGATGAACGG GTGACGACGC AAATTCCCAT TGGTAAAGCT ATCGATAATG TGTATTTAAG AGTTCTGGAC GAGGGGATGC AGCCTGTACA ACCAGGAAAC ACGGGAGAAC TCTGGCTTGG TGGAGTCCAA TTGGCATTAG GCTATCTCAA AGATCCAGAG AAAACGGCTC AGGCTTTTTG TCCTAACCCC TTTACTGATA TTCCTGGCGA TTATATTTAT CGGACAGGAG ATTTGGTCAA AGAACTGTCA GACGGAACCA TTGAATATCA TGGACGCATA GATCATCAAG TTAAAATTCG CGGTTTCCGC ATTGAATTGG GAGAAATCGA AAGTGTGTTA ACTACTCATC CCGATGTCCG GGAAGCCGCA GTTCTAGCCG TTGATTATGG GGAAGGACAA AAGCGATTAG TTGCTTGTAT ATCGGGAAAT AAAATCAAAA ATCGTTTCCT CAAGGAATAT CTAGAACAGA AACTTCCTCA CTACATGATT CCCCAAAGGT TTCTGTGGCT GGATAGTCTA CCCAAAAATC ATAATGGGAA ATTAGACCGC AAAACATTAG TGGCTCAATT AACCTCTGAT TCCTCTCCTT CCTCTGATTC CTCTCCTTCT TTGCTACCTT TATTACCTCT AGGGCCAGCT CAAAGTTGGT TAGTCAAATA TTTTGAGCCT CCTTATCAGT GGTTGGGTTA TACCCGTTTT CTCTATCATC AATCCCTTAA TGTAGATATC TTCAATCAAG CGGTTAATTT CCTTATTCAT CGTCATGACG CTTTCCGCAC GGTTTTTGTG CAGAACCAGG GACAATGGCA ACAACATATT CTCAATCAAA CGCCATCCTT TGTTTCTGAA TATTGGGATG GTAGTCAGGT AAGTCCAGAA GAACGGAATC TGAAAATTGA GCAATACATG GAAGTGGTAG CTAAACAAAT TCAGCTTGAC CGTTGGCCAA TGACGGCAGC GATCGTCGTC AAGGTGAGTC CTACTTGTTA CGATATTACG ATGATTGCTC ACCATATGAT TACTGATATG TTGAGCGCGA CAATTTTATT TAAAGAACTG TGGAGTGCTT ATCATCAACT TTTATTGGGT GTCTCTCCCA CATTTCCCCA GGCTAAATCC GAATCCTATG AAGATTTTAT TCAATTATTA GTGAATGAGG ATAATAAAGG AACCTTTGAA CATCATATTA AATATTGGCA ATCTCAATTT CCTGATACTA CAAGCAGATT TGAGATTCCT ATCGACCATG TGAAAGGAGC CAATATTGAA GCCTCGGCGG AAAAAGAATG TTTTCGTCTT AGTCAACAAC AGACAGAACT TTTATTAAAT CAAGGAAAAT CTTATTATGG CTCTAATGTT TACTTGATTC TCCTCGCGCC TTTATATCGT TTAATGGCTA ATTGGAGTCA TCGCCCAGGG GTGGTAATTA GTCATCGTAG TCATGGCAGA GATTTAGGAC AAGGTAGTAC TTTTATGGAA AGTATGGGGA ATTATGCAAT TAATTTCCCT CTAGGTATAA CTTTGGACTC ATCTGCGATT TGGCAAAAAA TTATTCAAGA GATTAGCGAT CGCTTTCAAA GTTTACCTAT GAATGGTGTA ACCTATGATT GGATAGGCGA TCGCTTACCT GAATACCTCT ACCCTGACAG CAATTTAACT TCTATACGGG TCAATTATTT AGGGAATCGT TCTGCTCCTC CGTCTAACCT CTTTGAGTTT ATACAGACTG TCCGAGACTG TCGTTTGTCA CTCCCCGATC AAAAACGTAC CACACTCATT GAATTTTTTC TTTTGGTAGC TGATGGTCAT TTAGAGATCA CAATTGAATA CTCTCGTAAT TTTCATCTCG CTACAACAAT TCAATCCTTA GGAAAACAAT ATCTCCTATT GCTTGAGGAT ATCTTGACTC CTGTCTCTAT GTCTCAGCCT CCCCTGCGCT AA
|
Protein sequence | MTFTLEIPYY QIIQQAQLTP DAIAVFDETN PLTYQQLDRL SNQVATYLRT QGVAPNTRVG IMAERGARMM IGILGILKAG GSYIPLDPGY PTDRLRYILE HATIQTLLTE HQVSPQLVAC VQEALPLQTV MFLDEGEPLE EIQSLNQITA ETWQGLSKDT LELCNSPDDL MVILYTSGST GRPKGVMLNH RGYMNRLTWM QNTFSLQPGD RVAQRTSFCF DISVWEIFWT LMSGATICPV QREVVLNPWE FAQWIQQTQI NVMHFVPSLF GEFISALENE TWSFPQLRWL MFSGEALPMS FIQRWIDRHG LKTGLANLYG PTEASIDVTC HLITERPDER VTTQIPIGKA IDNVYLRVLD EGMQPVQPGN TGELWLGGVQ LALGYLKDPE KTAQAFCPNP FTDIPGDYIY RTGDLVKELS DGTIEYHGRI DHQVKIRGFR IELGEIESVL TTHPDVREAA VLAVDYGEGQ KRLVACISGN KIKNRFLKEY LEQKLPHYMI PQRFLWLDSL PKNHNGKLDR KTLVAQLTSD SSPSSDSSPS LLPLLPLGPA QSWLVKYFEP PYQWLGYTRF LYHQSLNVDI FNQAVNFLIH RHDAFRTVFV QNQGQWQQHI LNQTPSFVSE YWDGSQVSPE ERNLKIEQYM EVVAKQIQLD RWPMTAAIVV KVSPTCYDIT MIAHHMITDM LSATILFKEL WSAYHQLLLG VSPTFPQAKS ESYEDFIQLL VNEDNKGTFE HHIKYWQSQF PDTTSRFEIP IDHVKGANIE ASAEKECFRL SQQQTELLLN QGKSYYGSNV YLILLAPLYR LMANWSHRPG VVISHRSHGR DLGQGSTFME SMGNYAINFP LGITLDSSAI WQKIIQEISD RFQSLPMNGV TYDWIGDRLP EYLYPDSNLT SIRVNYLGNR SAPPSNLFEF IQTVRDCRLS LPDQKRTTLI EFFLLVADGH LEITIEYSRN FHLATTIQSL GKQYLLLLED ILTPVSMSQP PLR
|
| |