Gene Ava_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1610 
Symbol 
ID3679167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1991901 
End bp1995329 
Gene Length3429 bp 
Protein Length1142 aa 
Translation table11 
GC content43% 
IMG OID637716950 
Productamino acid adenylation 
Protein accessionYP_322128 
Protein GI75907832 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0752074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA TGACAGTATT TGCATTTTTA TCTTCGTTAA ATAGGTTAGG TATAAAAGTT 
TGGATTGAAG ACAAGCAACT ACGTTATCGC GCACCCAAGG GAGTAATGAC TCCAAACATC
AAGCAGGATT TGATGGAGCA AAAGAACGAA ATTTTGAACT GTCTTCAACA ATCAATAAAT
ACAAAAAAAC TAGCTTTTGA ACCTATTTTA CCTACTGAGC GCAATCACCA TTTACCTTTA
TCTTTTAGTC AAGAAAGGAT GTGGTTTTTA CATCAGTTGG AAAGTGGAAG TGGTGCTTAT
ACAATAGCTT TTGCTGTGCG CCTAGAGGGA AATCTCAATA TCAAAGCTTT GGAACAAGCT
ATCGGGGAGA TAGTGCAGCG TCATGAGGTT TTGCGTACTC GCTTTGAAAT CAAAAATAAT
AAGCCAGTAC AAGTAATAGA CCCCAAAATA ACCTTAGCAT TGCCAGTAGT GGATCTGAAA
AATGTGGTAG ATCCCTGGCA ACAAGTCGAG GAACTGGCAA TAAAAGAAGC CTGCAAAGCA
TTTGATTTAG CGAATGATTC TGTGTTGCGG GTGATGCTCT GGACAGTTTC TCAAAATGAT
TATGCACTAT TGTTTGCTAT TCACCATATT GCTGCTGATG GTTGGTCAAA AGGTGTTTTC
ATAAACGAAC TTTCTGCTTA TTATCGAGCT ATTGCCAAGG GGGATTCTGT GGTATTGCCA
GAGTTACCTG TGCAGTATGC AGACTATAGC TTATGGCAAC GCCACCACCT GACAAATCAG
ACACTAGAGC ATCAATTAAG CTACTGGAAA CAGCAGTTAG CAGGAGCGTC ACCTGTATTA
GAACTACCTA CAGATCATCC CCGCCCGGTT ATACAGACTT TTCGAGGGGG TATAGAACGA
TTTCAAATAG ATGGTAAGCT GACGCAACAA CTCCAGAAAC TCAGCCAAGG TTCAGGAAGC
ACGTTGTTTA TGACGCTGCT GGCGGGTTTT GTTGTGTTAA TGTCTCGCTA CAGTGGGCAA
AGGGATCTGG TTGTTGGTTC TCCAATTGCC AATCGCAACC GTCAAGAAAT TGAAGGGTTA
ATTGGACTTT TTGTCAATAC TTTGGCATTG AGATTTGATC TGTCCCCAGA ACAGACCTTT
AATACCTTAC TAGAGCAGGT AAAACAGGTT ACTCAAGACG CTTATGAGAA TCAGGATTTG
CCCTTCGAGA TGTTAATCGA AGAGTTACAC CTTGAGCGAA GCCTAGATCG TAGCCCACTG
GTGCAAGTGA TGTTTGCGCT TCAGAACGCT CCGAAAAATT CTTGGGATCT ACCTAATTTG
AAGGTCGAGG AAATGCCTTG GGAGCTTGAT GCAGTGCGGT TTGACTTAGA AGTTCATTTC
TGGGAAGTTC CCCAAGGTCT TGAGGGGATT TGTTATTACA GCAGCGATTT ATTTGATGGG
GCAACGATCG CCCGCATGAT GAAACATTTC CAGAATTTGT TGGCAAATAT TGTCACTAAC
CCACAACAAT CAGTCAACCA ATTACCCCTA CTCACAGCAC CAGAAAAACA GCAATTATTA
ATAGAGTGGA ACAACACTGA TACTGATTAT CCCCGTAATC AATGTATCCA TCATTTATTT
GCTGCCCAAG TCCAAAAAAC TCCTGATGCG ATCGCTGTAG TATATGGAGA ACAACAACTC
ACATATCACC AGTTAAATAC ACAAGCAAAT CAATTAGCAC ATTATTTGCA AAAACTAGGT
GTGAAACCAG GTGTGTTGGT GGGTATTTGC GTTGAACGTT CTGTCTCTAT GATTGTTGGG
TTGTTGGCAA TTCTCAAGGC AGGCGGGGCT TATGTGCCTT TGGATACAGA ATATCCTCAA
GAGCGTTTGG CTTTCATCAT CGAAGACACA CAGCTATCGG TGCTATTGAC AACACAGAAA
ATAGCTGAAA CTCTGCCCCA AGATCAAGGG CGTGTTGTCT GCTTTGATAC TGATATAGAA
GCGATCGCTC TAGAGAGTCA GCAAAACCCC ACGGTAGAAG TCACAGCCGA TCATCTTGCT
TATGTCATCT ATACTTCAGG CTCAACAGGG ACACCCAAAG GAGTTGTTGT TGACCACAAA
GCAGTGAACC GCTTGGTGAT CAACACAAAC TATATCAACA TCAAACCCAC AGATGTCATT
GCTCAAGCTG CAAATTGCAC CTTTGACGCT GCAACCTTTG AAATTTGGGG AGCCTTGCTT
AACGGAGCGC GGTTGTTAGG AGTGAGAAAA GATTTGGCAC TTTCCCCCAA ACAGTTTGCA
ACTTTTATGC GATCGCAAGA TATCAGCGTA TTATTCTTAA CAACTGCCTT GTTCAATCAA
ATCGCTCAAG CAGTCCCCTC TGCTTTCAAT TCACTGCGGT ATCTTTTATT CGGCGGTGAG
GCTGTTGATG TTAAATGGGT CAGAGAAGTA CTAAACAATG GTGCGCCCCA GCAACTACTC
CACGTTTATG GGCCAACAGA GAATACCACA TTTACTTCCT GGTACTTAGT ACAGGATGTT
CCCGAAGACG CTACAACCAT TCCCATCGGG CGACCAATTG CTAACACACA AATTTACTTG
CTAGATTCCC AACTGCAACC AGTGGGCGTT GGTGTACCAG GAGAAATTTA CATTGGGGGT
GATGGTTTAG CCAGAGAATA TCTCAACCGA CCAGAGTTAA CACAACAGAA ATTTATTCCT
AACGCCTTTA GTTCTGATTC CCATTCATGT CTCTACAAAA CGGGAGATAA AGCGCGTTAC
CTGAGTGATG GCAATATTGA ATTTCTTGGT CGGATAGATC ATCAGGTGAA GATTCGCGGC
TTTCGTATCG AATTGGGAGA AATCGAAACC GTTTTGAGTC AACACCCCTT ATTAAAAGAA
AGTGTTGTGG TAGTAAGAGA GGACTCCCCT GGAGACAAAC GTCTAGTAGC TTATTTGGTT
CCAGCTGTTA ATGACTACAC TCATGACAAC CAGAAGCTAG TGCCGCAAGT ACGCGAATAT
ATCCAACAAA AGCTACCGAA TTACATGGTG CCACAAGCTT TTGTTCTCCT CCATGCCTTA
CCCTTGACAC CCAATGGCAA GGTAGACCGT CGCGCCCTAC CACAACCTGA TATAGCCACT
AGAAATCTCT CAACTGGCAC TGTTTTACCC CGGACTCCCA TTGAAGCTCA ACTGGCACAA
ATCTGGAGTG AAGTCTTAGG CGTGGAAACC ATTGGCGTTA AAGACAACTT CTTTGAGCTT
GGGGGTCATT CCCTACTAGC TACCCAAGTC CTGTCACAGA TCAACTCAAC CTTTGGATTG
GATTTATCTA TCCAGATCAT GTTTGAGTCT CCCACCGTAG CTGGGATAGC AGCCTATATA
GAAGTAGTGA ATTTAGTCAC ACAAAATTTA TCAAATAAAG AAGTCAGTAG CGAGGTAGTG
GAGTTTTAA
 
Protein sequence
MNKMTVFAFL SSLNRLGIKV WIEDKQLRYR APKGVMTPNI KQDLMEQKNE ILNCLQQSIN 
TKKLAFEPIL PTERNHHLPL SFSQERMWFL HQLESGSGAY TIAFAVRLEG NLNIKALEQA
IGEIVQRHEV LRTRFEIKNN KPVQVIDPKI TLALPVVDLK NVVDPWQQVE ELAIKEACKA
FDLANDSVLR VMLWTVSQND YALLFAIHHI AADGWSKGVF INELSAYYRA IAKGDSVVLP
ELPVQYADYS LWQRHHLTNQ TLEHQLSYWK QQLAGASPVL ELPTDHPRPV IQTFRGGIER
FQIDGKLTQQ LQKLSQGSGS TLFMTLLAGF VVLMSRYSGQ RDLVVGSPIA NRNRQEIEGL
IGLFVNTLAL RFDLSPEQTF NTLLEQVKQV TQDAYENQDL PFEMLIEELH LERSLDRSPL
VQVMFALQNA PKNSWDLPNL KVEEMPWELD AVRFDLEVHF WEVPQGLEGI CYYSSDLFDG
ATIARMMKHF QNLLANIVTN PQQSVNQLPL LTAPEKQQLL IEWNNTDTDY PRNQCIHHLF
AAQVQKTPDA IAVVYGEQQL TYHQLNTQAN QLAHYLQKLG VKPGVLVGIC VERSVSMIVG
LLAILKAGGA YVPLDTEYPQ ERLAFIIEDT QLSVLLTTQK IAETLPQDQG RVVCFDTDIE
AIALESQQNP TVEVTADHLA YVIYTSGSTG TPKGVVVDHK AVNRLVINTN YINIKPTDVI
AQAANCTFDA ATFEIWGALL NGARLLGVRK DLALSPKQFA TFMRSQDISV LFLTTALFNQ
IAQAVPSAFN SLRYLLFGGE AVDVKWVREV LNNGAPQQLL HVYGPTENTT FTSWYLVQDV
PEDATTIPIG RPIANTQIYL LDSQLQPVGV GVPGEIYIGG DGLAREYLNR PELTQQKFIP
NAFSSDSHSC LYKTGDKARY LSDGNIEFLG RIDHQVKIRG FRIELGEIET VLSQHPLLKE
SVVVVREDSP GDKRLVAYLV PAVNDYTHDN QKLVPQVREY IQQKLPNYMV PQAFVLLHAL
PLTPNGKVDR RALPQPDIAT RNLSTGTVLP RTPIEAQLAQ IWSEVLGVET IGVKDNFFEL
GGHSLLATQV LSQINSTFGL DLSIQIMFES PTVAGIAAYI EVVNLVTQNL SNKEVSSEVV
EF