Gene Ava_4746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4746 
Symbol 
ID3679633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5959995 
End bp5962946 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content42% 
IMG OID637720102 
Productamino acid adenylation 
Protein accessionYP_325238 
Protein GI75910942 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCA CTCTAGAAAT ACCCTATTAT CAAATTATTC AGCAAGCGCA ATTAACACCG 
GATGCGATCG CTGTTTTTGA TGAAACCAAT CCCCTGACTT ATCAACAGTT AGATCGTCTC
TCGAATCAAG TGGCTACTTA TTTACGCACC CAAGGAGTCG CCCCCAATAC GAGAGTCGGG
ATTATGGCTG AACGGGGAGC TAGAATGATG ATTGGCATTT TAGGCATTCT CAAAGCTGGC
GGGAGTTATA TCCCCCTTGA CCCTGGTTAT CCCACTGACC GACTGCGTTA CATCCTAGAA
CACGCAACTA TCCAGACTTT ACTTACAGAA CATCAAGTTA GTCCACAATT GGTGGCTTGT
GTTCAGGAGG CGCTACCTTT ACAAACGGTG ATGTTTTTAG ATGAGGGTGA ACCGTTAGAG
GAAATTCAAT CATTAAATCA AATTACTGCG GAAACTTGGC AAGGCTTGAG TAAGGACACT
TTAGAACTGT GTAACAGTCC TGACGATTTA ATGGTGATTC TCTACACCTC TGGCTCAACG
GGTCGTCCCA AAGGGGTAAT GCTGAATCAT CGAGGTTACA TGAATCGTCT GACATGGATG
CAGAATACTT TTTCATTACA ACCAGGCGAT CGCGTAGCCC AAAGAACCTC TTTTTGCTTT
GATATCTCGG TTTGGGAAAT TTTCTGGACA TTGATGTCAG GGGCGACTAT TTGTCCTGTG
CAACGGGAAG TAGTTCTCAA TCCTTGGGAA TTTGCTCAGT GGATTCAACA AACTCAAATT
AATGTGATGC ACTTTGTTCC TTCCCTGTTT GGGGAGTTTA TTAGTGCTTT AGAAAATGAA
ACTTGGAGTT TTCCCCAATT ACGTTGGTTG ATGTTTAGTG GGGAAGCATT ACCTATGTCC
TTTATCCAGA GGTGGATTGA CCGTCATGGA CTCAAAACCG GGCTAGCCAA TCTTTACGGC
CCTACAGAAG CTTCCATTGA CGTTACTTGT CACTTAATTA CAGAACGTCC TGATGAACGG
GTGACGACGC AAATTCCCAT TGGTAAAGCT ATCGATAATG TGTATTTAAG AGTTCTGGAC
GAGGGGATGC AGCCTGTACA ACCAGGAAAC ACGGGAGAAC TCTGGCTTGG TGGAGTCCAA
TTGGCATTAG GCTATCTCAA AGATCCAGAG AAAACGGCTC AGGCTTTTTG TCCTAACCCC
TTTACTGATA TTCCTGGCGA TTATATTTAT CGGACAGGAG ATTTGGTCAA AGAACTGTCA
GACGGAACCA TTGAATATCA TGGACGCATA GATCATCAAG TTAAAATTCG CGGTTTCCGC
ATTGAATTGG GAGAAATCGA AAGTGTGTTA ACTACTCATC CCGATGTCCG GGAAGCCGCA
GTTCTAGCCG TTGATTATGG GGAAGGACAA AAGCGATTAG TTGCTTGTAT ATCGGGAAAT
AAAATCAAAA ATCGTTTCCT CAAGGAATAT CTAGAACAGA AACTTCCTCA CTACATGATT
CCCCAAAGGT TTCTGTGGCT GGATAGTCTA CCCAAAAATC ATAATGGGAA ATTAGACCGC
AAAACATTAG TGGCTCAATT AACCTCTGAT TCCTCTCCTT CCTCTGATTC CTCTCCTTCT
TTGCTACCTT TATTACCTCT AGGGCCAGCT CAAAGTTGGT TAGTCAAATA TTTTGAGCCT
CCTTATCAGT GGTTGGGTTA TACCCGTTTT CTCTATCATC AATCCCTTAA TGTAGATATC
TTCAATCAAG CGGTTAATTT CCTTATTCAT CGTCATGACG CTTTCCGCAC GGTTTTTGTG
CAGAACCAGG GACAATGGCA ACAACATATT CTCAATCAAA CGCCATCCTT TGTTTCTGAA
TATTGGGATG GTAGTCAGGT AAGTCCAGAA GAACGGAATC TGAAAATTGA GCAATACATG
GAAGTGGTAG CTAAACAAAT TCAGCTTGAC CGTTGGCCAA TGACGGCAGC GATCGTCGTC
AAGGTGAGTC CTACTTGTTA CGATATTACG ATGATTGCTC ACCATATGAT TACTGATATG
TTGAGCGCGA CAATTTTATT TAAAGAACTG TGGAGTGCTT ATCATCAACT TTTATTGGGT
GTCTCTCCCA CATTTCCCCA GGCTAAATCC GAATCCTATG AAGATTTTAT TCAATTATTA
GTGAATGAGG ATAATAAAGG AACCTTTGAA CATCATATTA AATATTGGCA ATCTCAATTT
CCTGATACTA CAAGCAGATT TGAGATTCCT ATCGACCATG TGAAAGGAGC CAATATTGAA
GCCTCGGCGG AAAAAGAATG TTTTCGTCTT AGTCAACAAC AGACAGAACT TTTATTAAAT
CAAGGAAAAT CTTATTATGG CTCTAATGTT TACTTGATTC TCCTCGCGCC TTTATATCGT
TTAATGGCTA ATTGGAGTCA TCGCCCAGGG GTGGTAATTA GTCATCGTAG TCATGGCAGA
GATTTAGGAC AAGGTAGTAC TTTTATGGAA AGTATGGGGA ATTATGCAAT TAATTTCCCT
CTAGGTATAA CTTTGGACTC ATCTGCGATT TGGCAAAAAA TTATTCAAGA GATTAGCGAT
CGCTTTCAAA GTTTACCTAT GAATGGTGTA ACCTATGATT GGATAGGCGA TCGCTTACCT
GAATACCTCT ACCCTGACAG CAATTTAACT TCTATACGGG TCAATTATTT AGGGAATCGT
TCTGCTCCTC CGTCTAACCT CTTTGAGTTT ATACAGACTG TCCGAGACTG TCGTTTGTCA
CTCCCCGATC AAAAACGTAC CACACTCATT GAATTTTTTC TTTTGGTAGC TGATGGTCAT
TTAGAGATCA CAATTGAATA CTCTCGTAAT TTTCATCTCG CTACAACAAT TCAATCCTTA
GGAAAACAAT ATCTCCTATT GCTTGAGGAT ATCTTGACTC CTGTCTCTAT GTCTCAGCCT
CCCCTGCGCT AA
 
Protein sequence
MTFTLEIPYY QIIQQAQLTP DAIAVFDETN PLTYQQLDRL SNQVATYLRT QGVAPNTRVG 
IMAERGARMM IGILGILKAG GSYIPLDPGY PTDRLRYILE HATIQTLLTE HQVSPQLVAC
VQEALPLQTV MFLDEGEPLE EIQSLNQITA ETWQGLSKDT LELCNSPDDL MVILYTSGST
GRPKGVMLNH RGYMNRLTWM QNTFSLQPGD RVAQRTSFCF DISVWEIFWT LMSGATICPV
QREVVLNPWE FAQWIQQTQI NVMHFVPSLF GEFISALENE TWSFPQLRWL MFSGEALPMS
FIQRWIDRHG LKTGLANLYG PTEASIDVTC HLITERPDER VTTQIPIGKA IDNVYLRVLD
EGMQPVQPGN TGELWLGGVQ LALGYLKDPE KTAQAFCPNP FTDIPGDYIY RTGDLVKELS
DGTIEYHGRI DHQVKIRGFR IELGEIESVL TTHPDVREAA VLAVDYGEGQ KRLVACISGN
KIKNRFLKEY LEQKLPHYMI PQRFLWLDSL PKNHNGKLDR KTLVAQLTSD SSPSSDSSPS
LLPLLPLGPA QSWLVKYFEP PYQWLGYTRF LYHQSLNVDI FNQAVNFLIH RHDAFRTVFV
QNQGQWQQHI LNQTPSFVSE YWDGSQVSPE ERNLKIEQYM EVVAKQIQLD RWPMTAAIVV
KVSPTCYDIT MIAHHMITDM LSATILFKEL WSAYHQLLLG VSPTFPQAKS ESYEDFIQLL
VNEDNKGTFE HHIKYWQSQF PDTTSRFEIP IDHVKGANIE ASAEKECFRL SQQQTELLLN
QGKSYYGSNV YLILLAPLYR LMANWSHRPG VVISHRSHGR DLGQGSTFME SMGNYAINFP
LGITLDSSAI WQKIIQEISD RFQSLPMNGV TYDWIGDRLP EYLYPDSNLT SIRVNYLGNR
SAPPSNLFEF IQTVRDCRLS LPDQKRTTLI EFFLLVADGH LEITIEYSRN FHLATTIQSL
GKQYLLLLED ILTPVSMSQP PLR