Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4833 |
Symbol | |
ID | 3679409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6074194 |
End bp | 6076077 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637720190 |
Product | amino acid adenylation |
Protein accession | YP_325325 |
Protein GI | 75911029 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.954107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0358179 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAC AGTTAAGGAA ATATCCCCAC AACCAGTGCA TTCATCAGTT ATTTGAAGAA CAAGTAGAAC GCACACCTGA TGCAGTGGCT GTTGTTTTTG GTAAACAACA TTTGACTTAC CAACAATTAA ATCACCGAGC CAATCAATTA GCGCAATATC TGCGAACCTT GGGTATAGGA GCAGAAATGC TAGTAGGAAT TTGTCTAGAG CGATCGCCAG AAATGATCAT TGGATTGTTA GCAATCCTCA AAGCTGGGGG AGCTTATGTA CCTTTAGATG CAGGATATCC ACAAGAACGC CTAGCTTTCA TGCTAGTAGA TACCCAAATC CCAGTATTAT TAACCCAAAA AGAATTAGTC AAAAAATTAC CTAATCATGA GGCGCGCGTA ATTTGCCTTG ATACTGATTG GGAAATTATC AATCAACACA CACCAGAAAA CCAAAATATT AGCATCACAC CTGATAATTT GGCTTATGTC ATGTATACCT CAGGTTCTAC AGGACAACCC AAAGGTGTGA GTGTTGTTCA TCGTGGTGTA GTCCGCTTAG TCAAACAAAC TAACTACGCT AACTTTACTA ATACAGAAAT ATTTTTACAA TTTGCGCCCA TATCTTTTGA TGCTTCCACC TTTGAAATTT GGGGTTGCTT ACTCAACGGT GGAAAACTCG TTTTATATCC CAGTAACACC CCATCTATAG ATGAATTAGG ACAAGTTATT CAAAAATATC AAATCACTAC CATCTGGTTA ACAGCAGGCT TATTTCATCT CATGGTAGAT GAAAATATTC ATGCTTTAAA ACCCTTACGT CAACTGTTAG CAGGTGGTGA TGTTTTATCC GTTTCTCACG TCCAAAAATT TCTAAAAACA GTAGAAAATT GTCAACTAAT TAACGGTTAC GGCCCCACAG AAAACACCAC CTTTACCTGT TGTTATCACA TCAAAGACCC AGTCAGACCA GATAGCTCAA TTCCTATTGG TCGCCCCATC GCTCATACCC AGGTATACAT ATTAGATGAA AATTTGCAAC CAGTAGCAAT GGGAGCAACA GGAGAATTAT ATATTGGTGG CGACGGCTTG GCACGTGGTT ATCTCCATCG TCCAGAATTA ACCAAAGAAA GATTTATTGA ATTAAATAAC TCAAACTTTC AATCCCTAAC TCTATATAAA ACAGGGGATT TAGCTCGTTA TTTACCAGAT GGCAATATTG AATTTCTGGG ACGAATTGAC AACCAAGTGA AAATTCGAGG CTTCCGCATT GAGTTAGGAG AAATTGAGCG AGAGATTTCT CAATATCCTG ATGTGCGAGA AAACGTCGTT TTGGCTCATC AGACGGCAAC AGGCGAAAAG AGATTGGTAG CTTATATTGT GCTACATCAA AGCAGTTCAT ATAAACAAGA ACAATTACGT AATTTCCTCC AGCAGCGATT ACCAGACTAT ATGTTGCCAT CGGCATTTAT GGTTTTAGAA TCATTGCCGT TAACTGCTAA TGGCAAAGTA GATAGACATA AACTTCCCAC TCCCAGCAAA GAACGTCCCC AACTGGAACA AGTATATATT GCACCCCAAA CTGATTTACA ACGGCAGTTA ACCAATATTT GGTCTGATGT TTTGAATATT GAGCCAGTGG GTATTGATGA CAACTTCTTT GATTTGGGTG CAACTTCGAC CTTAATTATG CAGATAGCTG TGCGAGTACA ACAACAACTA GGAATTGAGC TATCGGTGGT GAAACTGTTT CAGTATCCTA CAATCGCTGG CTTGGAAAAA TATTTAAATG TAGAGCAGAA CACTCAACAA TCCTATAACC AGCTGCAAAG TCGCGCTCAA CGCCAGCAAG CAGCAGCTTC TGCTCGTCGT CGTCATAGTC AACGGGGGGT TTAA
|
Protein sequence | MEKQLRKYPH NQCIHQLFEE QVERTPDAVA VVFGKQHLTY QQLNHRANQL AQYLRTLGIG AEMLVGICLE RSPEMIIGLL AILKAGGAYV PLDAGYPQER LAFMLVDTQI PVLLTQKELV KKLPNHEARV ICLDTDWEII NQHTPENQNI SITPDNLAYV MYTSGSTGQP KGVSVVHRGV VRLVKQTNYA NFTNTEIFLQ FAPISFDAST FEIWGCLLNG GKLVLYPSNT PSIDELGQVI QKYQITTIWL TAGLFHLMVD ENIHALKPLR QLLAGGDVLS VSHVQKFLKT VENCQLINGY GPTENTTFTC CYHIKDPVRP DSSIPIGRPI AHTQVYILDE NLQPVAMGAT GELYIGGDGL ARGYLHRPEL TKERFIELNN SNFQSLTLYK TGDLARYLPD GNIEFLGRID NQVKIRGFRI ELGEIEREIS QYPDVRENVV LAHQTATGEK RLVAYIVLHQ SSSYKQEQLR NFLQQRLPDY MLPSAFMVLE SLPLTANGKV DRHKLPTPSK ERPQLEQVYI APQTDLQRQL TNIWSDVLNI EPVGIDDNFF DLGATSTLIM QIAVRVQQQL GIELSVVKLF QYPTIAGLEK YLNVEQNTQQ SYNQLQSRAQ RQQAAASARR RHSQRGV
|
| |