Gene Ava_4838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4838 
Symbol 
ID3679336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6086577 
End bp6090887 
Gene Length4311 bp 
Protein Length1436 aa 
Translation table11 
GC content41% 
IMG OID637720195 
Productamino acid adenylation 
Protein accessionYP_325330 
Protein GI75911034 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0776089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAT TGGATGAACT CCTATCTGAG CTACGTCAGC GAAATGTGCA GCTTTGGCTG 
GAAGGCGATC GCCTACGCTA TCGGGTAGCA AAAGACAGCC TCACCCCAGA ACTGCTAACA
GAGTTAAAGT CTCAAAAAGC GGAAATCATT GACTTTCTGC GGCGAATTAC CACAGTAGCC
AGTTCTCAAA TTCCTCCAAT CGTATCCTGT GAACGGAACG GTAGCTTACC ACTTTCTTTT
GGTCAACAAA GGTTATGGTT TCTCCATCAG TTTGAACCTG ATAGTTCCTC AAATAATATG
CCAGTGGTGG TGCGTTTTAC AGGGAATCTC AACGTTACCA TCCTCGAAGA AAGCATTAAC
GAAGTCGTCC GTCGTCATGA AGTTTTGCGG ACAACTTTCC CCGCCGTCAA TGGAAAAGCT
ACTCTACTCA TCGCTCCTGA TATTTCTTTA CAATTGCCAA TAATTGATTT GCAGTCAGTA
TCAGATGAAG AACGGGAGGC TGAGGCTTAC CGATTAGCAA CTAATGAAGC TCATCGACCC
TTTGACTTGG CTAACGGCCC CATCTTGCGA GTCTTACTGC TCAGGTTGAG CGAACAAGAG
CATCTGCTAA TTTGGAATAT GCACTGCATA GTTTGTGATG GGGCTTCATC TGATGTGTTT
TATCAGGACT TGACCACCAT CTACAAAGCT TTGGCAGAGG GAAAAGCATC TCCATTATCT
CCTTTACCAC TACAGTATGC TGATTTTGCT CACTGGCAAC ATCAGTGGCT ACAAGGAGAG
GTTTTAGAGT CCCAAGTAAA TTACTGGAAG CAAAAACTAG AGGGCAGTTT ACCTATTATT
CAATTACCTT ACGATCGCCC TCGTCCTGCT GGTGCGCAGA CTTATCGGGG CGATCGCTCC
GCCCTATTGC TACCAAAATC TCTGAATTAT GCCCTAACAG AATTGAGTCA AAAGTGGGGT
GTAACTCTCT TTATGACTCT ACTGACAGTG TTTGAGCTAT TACTCTACCG TTATTCTGGT
CAAGAAGACC TACTTATTAG TTTTGCCAGT GCTGGTCGGG GGCAGGTGGA AACCGAAAGA
CTAATTGGAT TTTTTTCTAA TACTCTTGTC CTGCGGAGTA ACTTGGCAGG TAACCCAACT
TTCCGCCAAT TGTTGGGTCG GGTACGTAAG GATTGCTTAG AAGCTTATGC CCACCAAGAC
CTACCCTTTG AACGATTAAT TGAAGAACTC AGACCAGAAC AACGCAATCG CAGCACCTCA
CCACTATTTC AGGTGAAATT TTCCCTCAAC CCCCCTTGGT CGAATGGTCG TGGCATGGCA
TCGGTGGAGT TGCCTGATTT GAAGATTACT TCCCTGTTTG GTTACATCTA TCATGGCAAG
ACCAAATATG ATTTGACATT GGTGTTACGG GAACAGGATA ACGGACTAGG CATGGTATTT
GACTACAATG CAGATATGTT TGAGTCAAGT ACCATAGAAA GGATGTTGGG ACACTTCCAG
AATTTACTTG AAGGTATTGT AGCTAATCCA GATTGTCCGA TTTCCGAATT AACTCTGTTA
ACACAGCCAG AGCAGCATCA GCTTTTAGTT GAGTGGAATC ATCGCCAAAC TGCGGATATT
AAAGATATTT GCATTCATCA ATTATTTGAG GCTCAAGTCC GACGCACTCC CCATAATATT
GCTGTAATTG AGGATAATCA ACAACTAAAT TATCAAGAAC TCAACGAACG TGCCAACCAA
TTAGCTCATT ATTTGCAGAC ATTAGGTGTA GGTGCAGGGA TATGTGTTGG TTTATATTTA
GAGCCATCTC TGGAAATGAT TGTCGGATTG CTAGGTATTT GCAAAGCCGG AGGAACATAC
ATACCAATCA CCCCAACATC CCACCCCAAT GATCTAGCTT TCATCTTAAA TGATGCTCAT
GTATCTTTAT TGTTAACTAA AAAGTCCTGG TCTGAGAAAC TACCTGAGTG TGAATCCAGC
ATTATTTGTT TGGATAGTGA TGAAGAAGTG ATCGCCCCCC ACAGTCGGCA AAATTTAGTC
ACTCAGGTAA CATCTGGGAA TCTCGCCTGT GTAATATATG CACCCAATCC GATAAATAAA
CCCGACGGGA TCGCCATGAG CCACAGCAAT TTAGTTAATC ATGCTGTAGC TATTCATCAA
CTTTGGGAAG TGAGCGCAGG CGATCGCATC CTAGTGTTTT CTGGTATCAG TAGCGATACA
ACTATCGAAT CACTATTTCC CTGTTGGATG AATGGTGCTA GTGCAGTTAT TCAGCCCCAA
ACCACCCAAA ACTCAATCAC AAATTTCTTT TCGTTCATCG CCCAACAGCA GATTACAGTT
CTCAATTTAC CTACTTTCTT TTGGTATAAA ATACTTAAAG AAATATCAAC CTCTCAAGCA
CCTTTACTTG AAAGCTTACG CTTGGTAATG GTTGGCGGCG AAAAAGTCTC ACGTACTGCT
TATGAAAGTT GGATAGAACT GGTCGGGAAA CAAACACGCT GGCTCAATGC CTATGGCTCA
ATCGCCACAA CTTTCACTGC CACAGTTTAC GATCCACAAA CAGCTAGTAG TGAAACAGAA
ATTCTCATCG GTCAACCCAT AGCCAATACT CAAATATACA TTCTCGATCA ACTATTGCAA
CCTGTACCCG TTGGCGCTCC TGGAGAAGTC TACATCAGTG GTGTTGGTGT CGCTAAAGGC
TACTTCAGAC GCACTGATTT GACATCTGAG AGATTTATTC CTCATCCCTT CAGTGACAAT
ACTCATGAAA GATTGTATAA AACTGGAGAT TTAGCCCGCT ACCTACCAGA TGGCAACATT
GAATATTTGG GACGCACTGA CAACCAAGTT AAAATATGTG GTGTCTGTGT TGATTTAGAA
CAAATAGAAG CTCTACTTCA TCAACATCAA GCCATAACTC AAGCCGTGGT TATTGCTACT
GAAGTAACTT CTGGCGAAAA GCAGCTAGTA GCTTACCTCG TCACTCAACC AGAACAAACT
CCTACAATTG ATGATTTACA AACTTTTCTC TCACAAAAAA TTCCCCATTA CTGGATTCCT
TCAGACTTTA TTTTCCTAGA ATCTCTACCT GTCAATACCA ACGGGCAAGT CAATCGTGGC
GCTCTCCCAG AGCCTAATTT CATCAAACAA AAATCAGGAG CTAATTTTGT AGCTCCTCGC
AATCAGCTAG AAATAGAGTT AACGAAAATT TGGGAAAATG TTTTAGGTAA ACATCCAATA
GGATTAAAAG ACAACTTTTT CGGGTTGGGA GGACATTCAC TTTTAGCACT ACGAATGTTT
TCTCAGATTG AAACTATTTT TGGTAAAAAT CTCCCCCTAG CTATCCTCTT TCAAGCACCG
ACAATTCAGC AATTATCTGA CATTTTGCAA CAGGAAGGAT GCTCAACTTC ATGGTCTTCA
CTTGTTCCTA TCCAACTCAA TGGATCTAAG CCACCTTTAT TTCTAATTCA TCCAATTGGT
GGTAACGTTT TAGAGTATTC TACCCTCACC CATTATTTAG GTGAAGAACA ACCCATTTAC
GGACTACAGT CCTTGGGATT AGATGGTAAG CAAGCTCCTC TGAACCGAGT TGAAGACATG
GCTAACGCCT ATCTTCAAGA AATACGTAGC ATTCAACCTA ATGGGCCATA TTTCATTGCA
GGTTATTCCT TTGGTGGATT AGTAGCCTAT GAAATGGCAC AACAACTATA CACTCAAGGA
CAAAAAATAG GACTCTTAGC CTTATTAGAT ACAAGCTGTC CTAACTTACA AATAACTCGT
CCATCATTGA ATAAATTTGT CCGAGTTCAT GTAAATAATC TTTGGAAACT TCAACCCCAA
GAGAAGTTGA TTTATATCCG CAATTGGTTG AAGTGGTATT TAAAAAAGAA AAATACCAGA
GATGTTTTAA TTCAAGATTG GCAAGAAGTT TTGGAAAACT CTCATATTGT CAATGTTATA
GATGCTAACG TACAAGCTTA TGAAAATTAC ACAGCTCAAC CGTATGCAGG AGCAATGACT
TTATTTCGCT CTAGCATACA GCCAGTAAAA TTGTCAGACA ATATTGATTT GGGATGGCAT
GATTTAGTCA CGGGAGGTTT AGAGATTCAT CATCTTACTG GAGATCACAG CAGGTTACTC
AAAGAACCCC ACGTCCAAAA ATTGACAAAA CAATTGAAAA TCTGTTTAGA GCGATCGCTA
ACAGACAATA CTTTGCTTTT TGATGATTTA AATAAATTTC AACCAAGGCT GATTTCTCAT
GTCGGGACAA CGCAAACTAG TAGTCAATTC TCTATCAATG TTAGTAAATA A
 
Protein sequence
MKTLDELLSE LRQRNVQLWL EGDRLRYRVA KDSLTPELLT ELKSQKAEII DFLRRITTVA 
SSQIPPIVSC ERNGSLPLSF GQQRLWFLHQ FEPDSSSNNM PVVVRFTGNL NVTILEESIN
EVVRRHEVLR TTFPAVNGKA TLLIAPDISL QLPIIDLQSV SDEEREAEAY RLATNEAHRP
FDLANGPILR VLLLRLSEQE HLLIWNMHCI VCDGASSDVF YQDLTTIYKA LAEGKASPLS
PLPLQYADFA HWQHQWLQGE VLESQVNYWK QKLEGSLPII QLPYDRPRPA GAQTYRGDRS
ALLLPKSLNY ALTELSQKWG VTLFMTLLTV FELLLYRYSG QEDLLISFAS AGRGQVETER
LIGFFSNTLV LRSNLAGNPT FRQLLGRVRK DCLEAYAHQD LPFERLIEEL RPEQRNRSTS
PLFQVKFSLN PPWSNGRGMA SVELPDLKIT SLFGYIYHGK TKYDLTLVLR EQDNGLGMVF
DYNADMFESS TIERMLGHFQ NLLEGIVANP DCPISELTLL TQPEQHQLLV EWNHRQTADI
KDICIHQLFE AQVRRTPHNI AVIEDNQQLN YQELNERANQ LAHYLQTLGV GAGICVGLYL
EPSLEMIVGL LGICKAGGTY IPITPTSHPN DLAFILNDAH VSLLLTKKSW SEKLPECESS
IICLDSDEEV IAPHSRQNLV TQVTSGNLAC VIYAPNPINK PDGIAMSHSN LVNHAVAIHQ
LWEVSAGDRI LVFSGISSDT TIESLFPCWM NGASAVIQPQ TTQNSITNFF SFIAQQQITV
LNLPTFFWYK ILKEISTSQA PLLESLRLVM VGGEKVSRTA YESWIELVGK QTRWLNAYGS
IATTFTATVY DPQTASSETE ILIGQPIANT QIYILDQLLQ PVPVGAPGEV YISGVGVAKG
YFRRTDLTSE RFIPHPFSDN THERLYKTGD LARYLPDGNI EYLGRTDNQV KICGVCVDLE
QIEALLHQHQ AITQAVVIAT EVTSGEKQLV AYLVTQPEQT PTIDDLQTFL SQKIPHYWIP
SDFIFLESLP VNTNGQVNRG ALPEPNFIKQ KSGANFVAPR NQLEIELTKI WENVLGKHPI
GLKDNFFGLG GHSLLALRMF SQIETIFGKN LPLAILFQAP TIQQLSDILQ QEGCSTSWSS
LVPIQLNGSK PPLFLIHPIG GNVLEYSTLT HYLGEEQPIY GLQSLGLDGK QAPLNRVEDM
ANAYLQEIRS IQPNGPYFIA GYSFGGLVAY EMAQQLYTQG QKIGLLALLD TSCPNLQITR
PSLNKFVRVH VNNLWKLQPQ EKLIYIRNWL KWYLKKKNTR DVLIQDWQEV LENSHIVNVI
DANVQAYENY TAQPYAGAMT LFRSSIQPVK LSDNIDLGWH DLVTGGLEIH HLTGDHSRLL
KEPHVQKLTK QLKICLERSL TDNTLLFDDL NKFQPRLISH VGTTQTSSQF SINVSK