Gene Ava_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4101 
Symbol 
ID3681566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5100210 
End bp5104247 
Gene Length4038 bp 
Protein Length1345 aa 
Translation table11 
GC content41% 
IMG OID637719449 
Productamino acid adenylation 
Protein accessionYP_324597 
Protein GI75910301 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.357734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.611275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACTG AAATGACTAA TCTGATTCGT ATTTCTCCTC AACAAAAGCA TTTATGGTTA 
CAACAAAAAG ATAATTATCA AGCTTACTGT ACACAAATTG CTATTAATAT CACAGGGAAT
CTGAACTCAG ATATTCTGAA ATTAGCTTTA GCAAATCTTA GCGATCGCTA TGAAATATTT
CACACTGCTT TTCATTCTTT ACCAGGGATG AATATCCCAT TCCAATCAAT TAGCGATGCT
CCAACTTACA ACCATATTTC CCTAGCAGAA TATGATTGGA GTCGTTTAAG TTCTCAGGTA
CAAAAAGCCA AAATCGATAC ACTTTTTCAA GATACCAGCC TCCGAGTTTT TGATTTTGAA
CAAGCGACAA ATTTACATTT TATCTTAGGG AAGCTAGCGA CAGATCAGCA TATACTGTTT
ATCAGCTTGC CTGCTTTATA TGCAGATAAG ACAACACTCA ACTATCTTAT ATGTGAAATC
GGTCGTGCTT ATACAGCTTG TTTAGATGGC GAAGAAATTT CTGAGGAAGT AATGCAATAT
GCAGATGTCT CTGAATGGCT GAATGAATTA TTAGAATCTG AGGATACAGA AACAGCTAGA
AATTATTGGC AACAACAAGA TATTTCTTCT GTGTTAAATT TGCGAATTCC TTTTGAAATC
AAAAATGATC ATCAGTCATC CGGAGAAATT AGCTTTGCAC CTCAGTCGCT GTGTTTAGCG
ATGAACCGCG ATACAGTAGA AAAACTAGAA ATTCTCCAAG AAAGTTATCA TACTTCAACA
GAAGCTATTT TATTAGCTTG TTGGCAAGTG CTACTGTGGC GGTTAAGCGG TCAGTCGGAA
ATTGTGATTG GGGCGGCTGG CGATGGACGC GAATATGATG AATTAAAAGG AGTTTTTGGA
CTTTTAACTA AATATTTACC GATTCACTCC CGCTTAGAGG ATGCTTTAAA ATTCTCAGAA
CTGATGGAAA GAGTTTCTGA GTCTTTGCGG TTTGTAAAAA AATGGCAAGA GTATTTTAAC
TGGGAGAATT TCATCTCTAC TAATGAGAAA TTTGCTGCTC GCCCCTTTTT CCCATTCTGC
TTTGATTTTA CACAACAGTC ACAGAGCTAT GTAAATGGAA ATATTGCCTT TTCGGAATAT
AAAAAATCTG CCAACATCGA CAAATTCCAA ATTAAGCTAT CTAGCGGTTA TAGCCAAGAT
AATTTGGTGG TGAATTTTGA TTACGATGCT AACTTATTTT CGGAGGATAG TATCAAAATT
CTCGCGGAAC AATATCTAAC ATTAATCGCA AGTGTCACTT ATAACCCAGA AGCAGCAATC
ATTGATTTAA ATATGATCAG TGAAAGCGAT CGCCAGAAAT TATTGATTGA TTTTAACAAT
ACAAAAATAG AGTTTCCCCA CGATAAATGT CTTCATGAAC TATTTGCCGC ACAAGTAGAA
CGTACCCCCA ACAACATCGC CGTTGAATTC AATCACGAAT CCCTCACTTA TGCCCAACTA
AACGCCAAAT CCAACCAGCT AGCGCATCAT CTCCAAAAAT TAGGAGTAAA ACCAGAAGTA
TTGGTGGGAA TTTGTGTAGA ACGTTCCTTA GATATGCTGA TTGGTATCCT GGGTATTCTC
AAAGCTGGTG GTGCATATAT ACCATTCGAT CCCACCTATC CCCAAGAACG ACTGGGATTT
ATGTTAGAAG ATGCTCAAAT TCCCATCCTG TTAACGCAAC AAAGACTGGT AGATAAATTT
GTTGAACATA AAACCCAGAT AATTTGTCTG GATCGAGACT TACCAGAAAA TGCCACCCTC
AGCATTGACA ACCCTGTCAG CAATGTTACA TCAGAAAATT TAGCTTATAT CATTTACACC
TCTGGTTCTA CAGGAAAACC CAAGGGAACG ATGATTCCCC ACAGAGGCTT AGTTAACTAC
CTGAGTTGGT GTACAAATGC TTATGCGTTA GCCCAAGGTT ATGGTGCGCC AGTACAATCT
TCCATCGGGT TTGACGCGAC AATTACCAGC TTATTCTCAC CGTTATTAGT CGGAAAAAGG
GTTGTATTGT TACCTGAGAA GGAAGAAATT GAAGCCTTGT CAGCACTTTT ACAGTCTGAT
CAAAACTATA GCTTGGTGAA AATTACACCA GCGCACCTGG AAATGCTCAA TCAAATGTTA
CCTAACCATA AAGGTGTCAC GGAAACAAGA GCATTAATCA TCGGTGGTGA AGCATTATTG
GGTAAAAGCT TGAACTTTTG GCGCGATAAT GCCCCAAAGA CCAGAATCAT TAACGAATAT
GGCCCCACCG AAACTGTTGT TGGTTGTTGT GTTTATGAGG TTGATGAGCA AACTTCTTTA
TCAGGTGCAA TTTTAATTGG TCGCCCCATT GCAAATACTC AACTTTATTT GCTGGATGAC
AAGCAAAAAC TTGTACCAAT CGGTGTTCCC GGCGAATTAT ATATTGGTGG TGCAGGAGTA
GCTAGGGGCT ATCTCAATCG TCCAGAATTA ACACAACAGC GATTTATCCC TAATCCTTTT
AGCGATGAAC CTAATTCTCG CCTATACAAG ACAGGAGATT TAGCGCGATA TTTACCTGAT
GGTAATATTG AGTACCTCGG ACGCATTGAC CATCAAGTGA AAATTCGTGG TTTCCGTATT
GAATTAGAAG AAATTGAGTC CTTACTAGCC CAACATCCCC TAGTAAATGC TGTCACTGTA
ATTGCGAGAG AAGACCAACC TGGAGATAAG CGGTTGGTAG GCTATATTGT ACCGAAAGAG
CAAGCCCCCA CCAGTAGCGA ACTGCGCCAA TTTTTACAGT CCAAGTTACC CGAATACATG
ATACCCTCTG CCTTTGTGAT GCTAGAGGTC ATCCCTCTCA CCACTCATGG TAAGGTAGAC
CGCCAAGCCC TACCGCAACC CGATACATCC CGTCTAGATA CACTCAAAGT AGAATTTATC
GCACCACGCG ATCGCCTAGA ACTAAAATTA GCCAATATTT GGGAAAATCT ACTCAATGTC
CATCCTATAG GTATCAAAGA TAGCTTTTTT GAACTCGGTG GTCACTCTCT GCTGGCTGTG
CGCTTGATGG CTCATATTAA CCAAGAGTTT GGTAAAAACT TAGCCTTAGC CACACTTTTC
CAAAACCCCA CAATCGAAAA ATTGGCTAAT CTGCTTCGCC AAACATCAGA AATCTCATCT
TGGTCTCCCT TAGTAGAAAT TCAAACAGGT AATTCCCAGT ATCCTTTCTT CTGCTTACCT
GGAGGCGGTG GTAATGTCCT TTATTTACAT GAGTTGGCGC GTGATTTAGG TTCTGAGCAA
ACATTTTACG GTTTACAAGC GCCAGGTTTA AATGGAGAAT CAGACCCCTT GACTAGTGTT
GAAGAGATGG CGGCTTATTA CATCCAGGCT ATACAAAGCG TGCAGCCGGA AGGGCCATAT
TTTCTCGGTG GTCACTCCTT TGGGGGTATT GTGGCCTATG AAATAGCTCA ACAGTTAGTC
AAATCCGGAC ATGAGGTGGC TTTAGTGGCG ATTTTAGACG CTCCAGCCCC AGTTGCTAGT
GACAAACCTA TATATATTGA TGTTGATGAT GCAACGCGAT TGACCGAAAC TGCTCGCCTG
ATTGAACGTT GGGCAGGTAA AAGCTTAAAT ATCAGCTATG AAATACTCCA GCCACTAGAA
CTCGACGCAC AATTGGAATA TCTCAAAGAA CAATTAATTG CCGTTGGTTT ACTCCCGACT
GGTACTGAGA CAAAGCAAGT GCGTGGGTTG GTGCAAGTGT TCGAGACCAA TCTGCAAGCT
AGTATCAAAT ATTCACCACA GGAAGTTTAT CCCCATCGCC TGACACTGCT GCGGGCTAGG
GAAGTGAACG CAGAGGATGC GGCACTATTA ACTGAGTTGC GCCAAGATCC CGCCTGGGGA
TGGGGTCAGT TTTCGACTGA AAAAGTCGAT ATTCACATTG TTCCAGGCGA TCATATGACG
ATGATGACTC AACCCCACAT CTCATCGGTT GCCAAACAAC TAAGAATCTG CATTGAGCAA
ACAAATGTAG GTTCATGA
 
Protein sequence
MHTEMTNLIR ISPQQKHLWL QQKDNYQAYC TQIAINITGN LNSDILKLAL ANLSDRYEIF 
HTAFHSLPGM NIPFQSISDA PTYNHISLAE YDWSRLSSQV QKAKIDTLFQ DTSLRVFDFE
QATNLHFILG KLATDQHILF ISLPALYADK TTLNYLICEI GRAYTACLDG EEISEEVMQY
ADVSEWLNEL LESEDTETAR NYWQQQDISS VLNLRIPFEI KNDHQSSGEI SFAPQSLCLA
MNRDTVEKLE ILQESYHTST EAILLACWQV LLWRLSGQSE IVIGAAGDGR EYDELKGVFG
LLTKYLPIHS RLEDALKFSE LMERVSESLR FVKKWQEYFN WENFISTNEK FAARPFFPFC
FDFTQQSQSY VNGNIAFSEY KKSANIDKFQ IKLSSGYSQD NLVVNFDYDA NLFSEDSIKI
LAEQYLTLIA SVTYNPEAAI IDLNMISESD RQKLLIDFNN TKIEFPHDKC LHELFAAQVE
RTPNNIAVEF NHESLTYAQL NAKSNQLAHH LQKLGVKPEV LVGICVERSL DMLIGILGIL
KAGGAYIPFD PTYPQERLGF MLEDAQIPIL LTQQRLVDKF VEHKTQIICL DRDLPENATL
SIDNPVSNVT SENLAYIIYT SGSTGKPKGT MIPHRGLVNY LSWCTNAYAL AQGYGAPVQS
SIGFDATITS LFSPLLVGKR VVLLPEKEEI EALSALLQSD QNYSLVKITP AHLEMLNQML
PNHKGVTETR ALIIGGEALL GKSLNFWRDN APKTRIINEY GPTETVVGCC VYEVDEQTSL
SGAILIGRPI ANTQLYLLDD KQKLVPIGVP GELYIGGAGV ARGYLNRPEL TQQRFIPNPF
SDEPNSRLYK TGDLARYLPD GNIEYLGRID HQVKIRGFRI ELEEIESLLA QHPLVNAVTV
IAREDQPGDK RLVGYIVPKE QAPTSSELRQ FLQSKLPEYM IPSAFVMLEV IPLTTHGKVD
RQALPQPDTS RLDTLKVEFI APRDRLELKL ANIWENLLNV HPIGIKDSFF ELGGHSLLAV
RLMAHINQEF GKNLALATLF QNPTIEKLAN LLRQTSEISS WSPLVEIQTG NSQYPFFCLP
GGGGNVLYLH ELARDLGSEQ TFYGLQAPGL NGESDPLTSV EEMAAYYIQA IQSVQPEGPY
FLGGHSFGGI VAYEIAQQLV KSGHEVALVA ILDAPAPVAS DKPIYIDVDD ATRLTETARL
IERWAGKSLN ISYEILQPLE LDAQLEYLKE QLIAVGLLPT GTETKQVRGL VQVFETNLQA
SIKYSPQEVY PHRLTLLRAR EVNAEDAALL TELRQDPAWG WGQFSTEKVD IHIVPGDHMT
MMTQPHISSV AKQLRICIEQ TNVGS