Gene Ava_4833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4833 
Symbol 
ID3679409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6074194 
End bp6076077 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content40% 
IMG OID637720190 
Productamino acid adenylation 
Protein accessionYP_325325 
Protein GI75911029 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.954107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0358179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAC AGTTAAGGAA ATATCCCCAC AACCAGTGCA TTCATCAGTT ATTTGAAGAA 
CAAGTAGAAC GCACACCTGA TGCAGTGGCT GTTGTTTTTG GTAAACAACA TTTGACTTAC
CAACAATTAA ATCACCGAGC CAATCAATTA GCGCAATATC TGCGAACCTT GGGTATAGGA
GCAGAAATGC TAGTAGGAAT TTGTCTAGAG CGATCGCCAG AAATGATCAT TGGATTGTTA
GCAATCCTCA AAGCTGGGGG AGCTTATGTA CCTTTAGATG CAGGATATCC ACAAGAACGC
CTAGCTTTCA TGCTAGTAGA TACCCAAATC CCAGTATTAT TAACCCAAAA AGAATTAGTC
AAAAAATTAC CTAATCATGA GGCGCGCGTA ATTTGCCTTG ATACTGATTG GGAAATTATC
AATCAACACA CACCAGAAAA CCAAAATATT AGCATCACAC CTGATAATTT GGCTTATGTC
ATGTATACCT CAGGTTCTAC AGGACAACCC AAAGGTGTGA GTGTTGTTCA TCGTGGTGTA
GTCCGCTTAG TCAAACAAAC TAACTACGCT AACTTTACTA ATACAGAAAT ATTTTTACAA
TTTGCGCCCA TATCTTTTGA TGCTTCCACC TTTGAAATTT GGGGTTGCTT ACTCAACGGT
GGAAAACTCG TTTTATATCC CAGTAACACC CCATCTATAG ATGAATTAGG ACAAGTTATT
CAAAAATATC AAATCACTAC CATCTGGTTA ACAGCAGGCT TATTTCATCT CATGGTAGAT
GAAAATATTC ATGCTTTAAA ACCCTTACGT CAACTGTTAG CAGGTGGTGA TGTTTTATCC
GTTTCTCACG TCCAAAAATT TCTAAAAACA GTAGAAAATT GTCAACTAAT TAACGGTTAC
GGCCCCACAG AAAACACCAC CTTTACCTGT TGTTATCACA TCAAAGACCC AGTCAGACCA
GATAGCTCAA TTCCTATTGG TCGCCCCATC GCTCATACCC AGGTATACAT ATTAGATGAA
AATTTGCAAC CAGTAGCAAT GGGAGCAACA GGAGAATTAT ATATTGGTGG CGACGGCTTG
GCACGTGGTT ATCTCCATCG TCCAGAATTA ACCAAAGAAA GATTTATTGA ATTAAATAAC
TCAAACTTTC AATCCCTAAC TCTATATAAA ACAGGGGATT TAGCTCGTTA TTTACCAGAT
GGCAATATTG AATTTCTGGG ACGAATTGAC AACCAAGTGA AAATTCGAGG CTTCCGCATT
GAGTTAGGAG AAATTGAGCG AGAGATTTCT CAATATCCTG ATGTGCGAGA AAACGTCGTT
TTGGCTCATC AGACGGCAAC AGGCGAAAAG AGATTGGTAG CTTATATTGT GCTACATCAA
AGCAGTTCAT ATAAACAAGA ACAATTACGT AATTTCCTCC AGCAGCGATT ACCAGACTAT
ATGTTGCCAT CGGCATTTAT GGTTTTAGAA TCATTGCCGT TAACTGCTAA TGGCAAAGTA
GATAGACATA AACTTCCCAC TCCCAGCAAA GAACGTCCCC AACTGGAACA AGTATATATT
GCACCCCAAA CTGATTTACA ACGGCAGTTA ACCAATATTT GGTCTGATGT TTTGAATATT
GAGCCAGTGG GTATTGATGA CAACTTCTTT GATTTGGGTG CAACTTCGAC CTTAATTATG
CAGATAGCTG TGCGAGTACA ACAACAACTA GGAATTGAGC TATCGGTGGT GAAACTGTTT
CAGTATCCTA CAATCGCTGG CTTGGAAAAA TATTTAAATG TAGAGCAGAA CACTCAACAA
TCCTATAACC AGCTGCAAAG TCGCGCTCAA CGCCAGCAAG CAGCAGCTTC TGCTCGTCGT
CGTCATAGTC AACGGGGGGT TTAA
 
Protein sequence
MEKQLRKYPH NQCIHQLFEE QVERTPDAVA VVFGKQHLTY QQLNHRANQL AQYLRTLGIG 
AEMLVGICLE RSPEMIIGLL AILKAGGAYV PLDAGYPQER LAFMLVDTQI PVLLTQKELV
KKLPNHEARV ICLDTDWEII NQHTPENQNI SITPDNLAYV MYTSGSTGQP KGVSVVHRGV
VRLVKQTNYA NFTNTEIFLQ FAPISFDAST FEIWGCLLNG GKLVLYPSNT PSIDELGQVI
QKYQITTIWL TAGLFHLMVD ENIHALKPLR QLLAGGDVLS VSHVQKFLKT VENCQLINGY
GPTENTTFTC CYHIKDPVRP DSSIPIGRPI AHTQVYILDE NLQPVAMGAT GELYIGGDGL
ARGYLHRPEL TKERFIELNN SNFQSLTLYK TGDLARYLPD GNIEFLGRID NQVKIRGFRI
ELGEIEREIS QYPDVRENVV LAHQTATGEK RLVAYIVLHQ SSSYKQEQLR NFLQQRLPDY
MLPSAFMVLE SLPLTANGKV DRHKLPTPSK ERPQLEQVYI APQTDLQRQL TNIWSDVLNI
EPVGIDDNFF DLGATSTLIM QIAVRVQQQL GIELSVVKLF QYPTIAGLEK YLNVEQNTQQ
SYNQLQSRAQ RQQAAASARR RHSQRGV