Gene Ava_4837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4837 
Symbol 
ID3679335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6083299 
End bp6086580 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content41% 
IMG OID637720194 
Productamino acid adenylation 
Protein accessionYP_325329 
Protein GI75911033 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0536139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAT TACCTATAGA AAATCAAACT GAATTGACCG CTATTGATTT CGATCCATTT 
GCAGATGGTG AATTATTATT AACTGCCCCT GCTACCGAAT CTCAAAAAGA AATTTGGGCA
TCTGTACAGA TGGGGAGTGC TGCTAATTGT GCTTATAACG AATCCCAATC CCTAAACTTA
AAAGGACAAT TCAATTTAGA AGCTTTTCAA TCTGCTATCC AACAGCTAGT TCAACGTCAT
GAGGCTTTAC GAACAACCTT TAGTACTAAT GGTAGTACAC TCTGTATTGT GGCTTCGTTA
CTAATTGATA TTCCTTTTAT TGATGTTTCT AATTTAGAGG TTCAAGAGAG AGAAATAAAA
GTAGCTGAAA TCTTACAGCA AGAAGTAGGA CAACCTTTTG ATTTAGAACA TGGCCCACTA
TTTCGAGCAA AAATAATTAA ATTGCATCCC CAAGAACATT TAGTTACTTT AACTGCCCAT
CATATTATTT GTGATGGTTG GTCTTGGGGG GTGCTGATGC CTGATTTGGG TCAGATATAT
TCTGATTTGC TGCAAGGGAT AATGCCTGAT TTAGAAGAGC CAGACAAATT TAGTGAATAC
GCTATTTTAC AGGAAGAAGA AGAACAAACA TCAGAAGCGA TCGCCACAGA ACAATATTGG
TTAGAGCAAT TTTCCGACTC AGTACCCGTA GTAGACTTTC CTACAGATCG TCCCCGTCCA
CCACTGAGGA CTTTTAACGC CGCCCGTGAA GACTGGCAAT TAAAACCCGA ACTAGTTACC
CACCTCAAAC AACTAGGAAC AAAATTTGGT TGTAGCTTTA TGACAACTAT TTTGGCAGGG
TTTGAAGTGT GGCTGCATCG ATTGACCGGA CAAAATGATC TAGTGGTCGG TATTCCCGCC
GCCGGACAAG CTGCCTCAGG ACAGTATAAT CTTGTAGGGC ATTGCGTAAA TTTATTACCA
CTGCGGACTC AGATAGATAG CCAAAAATCT TTTAGTGACT ACTTACAAAC TAGACGTTCC
ACCATCTTAG ATGCCTACGA TCATCAACAA TTCACCTTTG GTGGCCTAGT CAAAAAACTG
TCACTGCCAC GGGATTCTAG CCGGATTCCT TTAGTGCCAA TTACATTCAA TATTGATCAA
GGATTGGATA GTGATAAATT ACCCTTCGCT GGACTGGAAG TAGAATTTTT CTCTAATCCT
CGCTCATTTG AAAATTTTGA ACTGTTTATC AACGCTACAG AATTGCGTGG TCAACTGACA
CTGGAATGTC AGTACAACAC TAATTTATTT GATGCTGATA CTATCCGCCG TCGCCTGGCA
GAGTTGGAAA CCTTGCTATT GGGTATAGTG TCTAACCCCG ATCAAATTAT TGCTAAGTTG
CCCATCTTAC CAGCCGTTGA GCAACAGTTA TTAGCCGCAT GGAACCAGAC TCAAACTGAC
TATCCCCAAG ATAAATCCAT ACACCAGTTA TTTGCGGAAC AGGTAGAGCG CACCCCCGAC
GCTGTGGCGT TGATATTTCA AGAGCAACAA CTAACTTATC GAGAATTAAA TATTCGAGCC
AATAAATTGG CGCAATATCT GCAAACACTA GGAGTCGGGG CAGACGTTCT GGTGGGTATC
TGCGTGGAGC GTTCCTTAGA AATGGTGGTG GGTCTTTTAG GCATCCTAAA AGCAGGTGGC
GCTTATGTAC CACTAGATCC TGGTTATCCA CCAGAACGCC TGGCTTTCAT GCTGGCAGAC
ACCCAAATTC AATTATTATT GACTCAAAAA CACCTAGTTG AAAAGCTACC TACCCACACT
GCAACATTAA TTTGTTTAGA TACCGATGGG GATACGATCA ACCAAACAAC AGTAGAAAAT
CCCGTTAGTA ACGTCACAGC CAATAACTTG GCTTATGTGA TGTACACATC TGGTTCTACA
GGTCAGCCCA AAGGTGTCAG TGTTATTCAT CAAGGCGTAG TGAGATTAGT CAAAGAAACT
AACTATGTCA GCCTCACCAA TAAAGAAGTA TTTCTGCAAA TTAGCCCCAT CTCTTTCGAC
GCTTCAACTT TTGAAATTTG GGGTTGTTTG CTCAACGGTG GAAAACTAGT AATATTCCCT
CCAAACACAC CATCTTTAGA CGAATTAGAG CAAATTATTC AACAACACCA GGTAACAACT
CTGTGGTTAA CGGCTGGGTT ATTTCATCTG ATAGTCGATG AAAAAATTGA TGCGTTAAAA
CCCTTACGTC AATTATTAGC AGGTGGTGAT GTTTTATCAG TTTCCCACGT CCAGAAATTC
CTTAGCACAG TAGAGAACTG TAAATTAATT AATGGTTATG GACCAACGGA AAACACAACT
TTTACTTGCT GTTATGACAT CACAGCACCA CTAAAACCAG GGGCTTCTAT TCCCATTGGT
CGCCCGATCG CCAATAGCCA AGTTTATATA TTAGACTCTC ATTTACAACC AGTTCCCATT
GGGGTCACAG GTGAACTGTA CATTGGTGGG GATGGTCTAG CGCGAGAATA TTTCCATCGC
CCTGATTTAA CTGCTGAGAG ATTTATTCTT CATCCTTTTA ACTCAGATCC TCAATCACGC
TTATATAAAA GTGGTGATTT GGCACGCTAT TTACCCGATG GCGAGATTGA ATACCTGGGT
CGGATTGATA ATCAGGTAAA AGTCAGTGGT TTCCGCATAG AATTGGGTGA AATTGAAATC
GCCTTTTTAC AACATCCAGC AGTCAAAGAA GCTGTTGTAG TTGTTAGAGA AGATATTCCT
GGAGAGAAAC TCTTAGTTGG TTATTTTGTT GCAGAAACTG GTCAAGATAA CTTGTCAATC
ATCTCGGAAT TACGTCAATT TTTAAAACAA CAACTCCCAG AATATATGGT GCCAAAGGTT
TTTATGGCAC TGGAAGCTTT ACCACTAAAT GCTAACGGCA AAGTTGATCG CCGAGTACTA
CCAAAACCTG ATTCTTCTCG TCCAGAATTA GAAGCAAATT ATGTCGCACC GCGCACTCCC
ATTGAACAAC AAATTGCTGA CATCTGGACG CAGGTTCTCA ATATCAAACG GGTGGGAATT
TACGACAACT TTTTTGAACT GGGTGGATAT TCTCTAGTAG GCATCCAAGT AATATCTAGA
CTACGCCAAG CCCTGCAAGT AGAAATCTTG ATGTCCAACT TATTTGAACT ACCAACCGTG
GCAGATTTAG CAGAACGAGT AGAGACTCTG CGTTGGGCAA CTCAGAACGT TCAAGCTACT
GGAGTTGAAA CCGCAGATGA TTACGAGGAA GGTGAGCTAT GA
 
Protein sequence
MKILPIENQT ELTAIDFDPF ADGELLLTAP ATESQKEIWA SVQMGSAANC AYNESQSLNL 
KGQFNLEAFQ SAIQQLVQRH EALRTTFSTN GSTLCIVASL LIDIPFIDVS NLEVQEREIK
VAEILQQEVG QPFDLEHGPL FRAKIIKLHP QEHLVTLTAH HIICDGWSWG VLMPDLGQIY
SDLLQGIMPD LEEPDKFSEY AILQEEEEQT SEAIATEQYW LEQFSDSVPV VDFPTDRPRP
PLRTFNAARE DWQLKPELVT HLKQLGTKFG CSFMTTILAG FEVWLHRLTG QNDLVVGIPA
AGQAASGQYN LVGHCVNLLP LRTQIDSQKS FSDYLQTRRS TILDAYDHQQ FTFGGLVKKL
SLPRDSSRIP LVPITFNIDQ GLDSDKLPFA GLEVEFFSNP RSFENFELFI NATELRGQLT
LECQYNTNLF DADTIRRRLA ELETLLLGIV SNPDQIIAKL PILPAVEQQL LAAWNQTQTD
YPQDKSIHQL FAEQVERTPD AVALIFQEQQ LTYRELNIRA NKLAQYLQTL GVGADVLVGI
CVERSLEMVV GLLGILKAGG AYVPLDPGYP PERLAFMLAD TQIQLLLTQK HLVEKLPTHT
ATLICLDTDG DTINQTTVEN PVSNVTANNL AYVMYTSGST GQPKGVSVIH QGVVRLVKET
NYVSLTNKEV FLQISPISFD ASTFEIWGCL LNGGKLVIFP PNTPSLDELE QIIQQHQVTT
LWLTAGLFHL IVDEKIDALK PLRQLLAGGD VLSVSHVQKF LSTVENCKLI NGYGPTENTT
FTCCYDITAP LKPGASIPIG RPIANSQVYI LDSHLQPVPI GVTGELYIGG DGLAREYFHR
PDLTAERFIL HPFNSDPQSR LYKSGDLARY LPDGEIEYLG RIDNQVKVSG FRIELGEIEI
AFLQHPAVKE AVVVVREDIP GEKLLVGYFV AETGQDNLSI ISELRQFLKQ QLPEYMVPKV
FMALEALPLN ANGKVDRRVL PKPDSSRPEL EANYVAPRTP IEQQIADIWT QVLNIKRVGI
YDNFFELGGY SLVGIQVISR LRQALQVEIL MSNLFELPTV ADLAERVETL RWATQNVQAT
GVETADDYEE GEL