Gene Ava_4664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4664 
Symbol 
ID3679820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5825240 
End bp5827996 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content48% 
IMG OID637720020 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_325156 
Protein GI75910860 
COG category[C] Energy production and conversion
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming)
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID[TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAT CTCTCAAGCC AAGCAGTGAC CGTACCTCTG ATATCTTACA GGCAGAGAAG 
TTAAACCCCC TTGATGCCAT CTTTGCGCCC CAAAGTGTCG CCATTATTGG TGCTAGCGAA
AAGGTGGGTA GTGTAGGACG CACCATCTTA TGGAACTTGA TTAGTAACCC CTTTGGCGGT
ACTGTTTTCC CAGTCAATCC TAAACGTCAT AGTGTTTTGG GAATTAAAGC CTACCCTTCA
ATTGCGTCTA TTCCCGAAAC CGTAGATTTG GCAATTATCG CTACCCCAGC ACCCACAGTA
CCGGGGATTA TTTCTGAGTG TGTGGATGCG GGTGTGCAAG GGGCAATTAT TATTTCTGCT
GGCTTTAAAG AAGCTGGTGC TGAGGGTATA GCTTTAGAAA GGCAGATTTT AGCAGAGGCG
CGGCGCGGTA ACATTCGCAT CATTGGCCCC AACTGCTTGG GTGTGATGAG TCCCCGTACT
GGCCTAAATG CTACCTTCGC CAGTTCAATG GCGCGTTCTG GAAATGTGGG TTTTCTCAGT
CAAAGTGGCG CACTGTGTAC CGCCATCCTG GATTGGAGTG TGCGGGAAAA TGTCGGTTTT
AGTGCCTTTG TCTCCATTGG TTCCATGCTA GATGTGGGTT GGGGCGACCT GATTTATTAT
CTTGGTGATG ACCCCCAGAC TAAAAGTATA GTCATATACA TGGAATCAAT TGGTGATGCG
CGGTCTTTTA TCTCCGCAGC ACGAGAAGTT GCACTCACCA AACCAATAAT TGTGATTAAA
GCCGGACGTA CAGAAGCAGC CGCCAAAGCA GCCGCCTCTC ATACTGGTGC ATTGGCGGGG
AGTGATGCTG TGTTAGATGC AGCTTTCCGG CGTTGTGGGG TATTGCGGGT AAATAGTATT
TCTGATTTGT TCGATATGGC GGAAGTACTA GCCAAACAAC CCCGCCCCAA AGGCCCAAGA
CTGACGATTT TGACTAATGC TGGCGGGCCT GGAGTCCTGG CCACAGATGC GTTAATTGAA
ACTGGCGGCG AAATTGCCCC CATTTCCCCA GAAACAATTA CCTCCCTCGA CCAAATCCTA
CCTACACACT GGAGTCATGC TAACCCAATT GATATTCTCG GTGATGCTGA CCCCCAACGC
TACACTCAAG CTTTAGAAAT CGCTGCTAAA GACCCCAACA GCGATGGTTT ATTGGTGATT
CTCACTCCCC AAGCCATGAC AGACCCCACC CAAACGGCGG AACAGTTGAA ACCCTACGCC
CAAATTGCTG GTAAACCCAT TTTGGCAAGT TGGATGGGGG GTGCAGATGT GGCTACGGGA
GAAGTTATTC TCAATCGTCA ACGTATCCCC ACTTATGCTT ATCCAGATAC GGCGGCGCGG
GTGTTTAGTT ATATGTGGCA ATCTAGCTAC AATCTGCGTG GTATCTACGA AACGCCTGTA
TTACCTGTGG ATGCTGCATC TGGTTTACCA GACCGTCATT TAGTAGAGAA TATTATCTCT
ACAGCCCGTC AGGCAAAACG GACAATTTTA ACTGAAGATG AATCCAAGCA GATTTTAGCA
GCCTATGGCA TCCCGATTGT GGCTACTTGT GTGGCTAAAA CTGAGGATGA GGCGATTAAA
TGTGCTGAGA GTATTGGTTA TCCCGTCGTT GTCAAACTGT ATTCCCACAC AATTACCCAT
AAAACTGATG TGGGTGGTGT GCAGTTAAAC CTCCCCGATG CAGACGCAGT ACGCCGCGCT
TATCGGATGA TTGCTGCATC GGTGGAACAG AAAGTGGGTA GTGAACATTT CTTGGGTGTA
ACTGTCCAGC CAATGGTAAA AATGGATGGC TACGAATTGA TTATTGGGAG TAGCCTTGAC
CCCCAATTTG GGCCAGTGTT GTTATTTGGT GCTGGTGGAC AATTAGTGGA AGTATTTCAA
GATCGAGCGA TCGCCCTTCC TCCCCTTAAT AGTACCTTAG CTCGGCGCAT GATGGAACAC
ACCAAGATTT ACAAAGCCCT TAAAGGTGTG CGGGGAAGGC AAAGTGTCGA TATGGAAGGA
CTAGAACAAC TAATGGTGGC GTTTAGTCGG TTGGTAGTCG AACAGCGTTG GATTAAGGAA
ATTGATATTA ATCCCTTGCT GGCTTCACCT GTGCAGGAGA ATGGGGAAAA CTCCTCACTG
ATTGCCCTAG ATGCACGGGT TGTTTTGCAT GAACCAGATG TGACAGAAGA CCAACTACCA
AAGTTAGCAA TTCGACCTTA CCCCACCCAA TATGTGGAAC AGTGGACAAT GAAAGACGGG
ACTCCCGTAA CCATCCGTCC AATTCGTCCC GAAGATGAGC CGTTGTTAGT ACAATTCCAC
AAGACACTTT CCGAGGAAAG CGTTTACTTT CGTTACTTCC ACCTGATGAA ATTGAGTCAT
CGCATCACCC ATGAACGACT CACCCGCATC TGCTTTATTG ACTATGACCG AGAAATGGCT
TTGGTTATAG AGTCTCAAGG GGAAATTTTG GCAGTTGGGA GGTTAAGTAA ACTGCATGGG
ACAAAGACAG CCGAGTTTGC TATGTTAGTA AGCGATCGCT ATCAATGTCA GGGTTTAGGT
GCAGAGTTAC TGCGGCGCTT GCTGCAAATT GGACGCGATG AGCAAATCGA ACGCATCACA
GCCGATATCC TAGCTGATAA TTATGGGATG CAGCGAGTGT GCGAAAAGCT AGGTTTTAAG
CTAGAACGCA CCGCCGAAGC AAGTGTTATG AAAGCGGAAT TGGTTATTGG TCATTAG
 
Protein sequence
MQKSLKPSSD RTSDILQAEK LNPLDAIFAP QSVAIIGASE KVGSVGRTIL WNLISNPFGG 
TVFPVNPKRH SVLGIKAYPS IASIPETVDL AIIATPAPTV PGIISECVDA GVQGAIIISA
GFKEAGAEGI ALERQILAEA RRGNIRIIGP NCLGVMSPRT GLNATFASSM ARSGNVGFLS
QSGALCTAIL DWSVRENVGF SAFVSIGSML DVGWGDLIYY LGDDPQTKSI VIYMESIGDA
RSFISAAREV ALTKPIIVIK AGRTEAAAKA AASHTGALAG SDAVLDAAFR RCGVLRVNSI
SDLFDMAEVL AKQPRPKGPR LTILTNAGGP GVLATDALIE TGGEIAPISP ETITSLDQIL
PTHWSHANPI DILGDADPQR YTQALEIAAK DPNSDGLLVI LTPQAMTDPT QTAEQLKPYA
QIAGKPILAS WMGGADVATG EVILNRQRIP TYAYPDTAAR VFSYMWQSSY NLRGIYETPV
LPVDAASGLP DRHLVENIIS TARQAKRTIL TEDESKQILA AYGIPIVATC VAKTEDEAIK
CAESIGYPVV VKLYSHTITH KTDVGGVQLN LPDADAVRRA YRMIAASVEQ KVGSEHFLGV
TVQPMVKMDG YELIIGSSLD PQFGPVLLFG AGGQLVEVFQ DRAIALPPLN STLARRMMEH
TKIYKALKGV RGRQSVDMEG LEQLMVAFSR LVVEQRWIKE IDINPLLASP VQENGENSSL
IALDARVVLH EPDVTEDQLP KLAIRPYPTQ YVEQWTMKDG TPVTIRPIRP EDEPLLVQFH
KTLSEESVYF RYFHLMKLSH RITHERLTRI CFIDYDREMA LVIESQGEIL AVGRLSKLHG
TKTAEFAMLV SDRYQCQGLG AELLRRLLQI GRDEQIERIT ADILADNYGM QRVCEKLGFK
LERTAEASVM KAELVIGH