Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4664 |
Symbol | |
ID | 3679820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5825240 |
End bp | 5827996 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637720020 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_325156 |
Protein GI | 75910860 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | [TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAAT CTCTCAAGCC AAGCAGTGAC CGTACCTCTG ATATCTTACA GGCAGAGAAG TTAAACCCCC TTGATGCCAT CTTTGCGCCC CAAAGTGTCG CCATTATTGG TGCTAGCGAA AAGGTGGGTA GTGTAGGACG CACCATCTTA TGGAACTTGA TTAGTAACCC CTTTGGCGGT ACTGTTTTCC CAGTCAATCC TAAACGTCAT AGTGTTTTGG GAATTAAAGC CTACCCTTCA ATTGCGTCTA TTCCCGAAAC CGTAGATTTG GCAATTATCG CTACCCCAGC ACCCACAGTA CCGGGGATTA TTTCTGAGTG TGTGGATGCG GGTGTGCAAG GGGCAATTAT TATTTCTGCT GGCTTTAAAG AAGCTGGTGC TGAGGGTATA GCTTTAGAAA GGCAGATTTT AGCAGAGGCG CGGCGCGGTA ACATTCGCAT CATTGGCCCC AACTGCTTGG GTGTGATGAG TCCCCGTACT GGCCTAAATG CTACCTTCGC CAGTTCAATG GCGCGTTCTG GAAATGTGGG TTTTCTCAGT CAAAGTGGCG CACTGTGTAC CGCCATCCTG GATTGGAGTG TGCGGGAAAA TGTCGGTTTT AGTGCCTTTG TCTCCATTGG TTCCATGCTA GATGTGGGTT GGGGCGACCT GATTTATTAT CTTGGTGATG ACCCCCAGAC TAAAAGTATA GTCATATACA TGGAATCAAT TGGTGATGCG CGGTCTTTTA TCTCCGCAGC ACGAGAAGTT GCACTCACCA AACCAATAAT TGTGATTAAA GCCGGACGTA CAGAAGCAGC CGCCAAAGCA GCCGCCTCTC ATACTGGTGC ATTGGCGGGG AGTGATGCTG TGTTAGATGC AGCTTTCCGG CGTTGTGGGG TATTGCGGGT AAATAGTATT TCTGATTTGT TCGATATGGC GGAAGTACTA GCCAAACAAC CCCGCCCCAA AGGCCCAAGA CTGACGATTT TGACTAATGC TGGCGGGCCT GGAGTCCTGG CCACAGATGC GTTAATTGAA ACTGGCGGCG AAATTGCCCC CATTTCCCCA GAAACAATTA CCTCCCTCGA CCAAATCCTA CCTACACACT GGAGTCATGC TAACCCAATT GATATTCTCG GTGATGCTGA CCCCCAACGC TACACTCAAG CTTTAGAAAT CGCTGCTAAA GACCCCAACA GCGATGGTTT ATTGGTGATT CTCACTCCCC AAGCCATGAC AGACCCCACC CAAACGGCGG AACAGTTGAA ACCCTACGCC CAAATTGCTG GTAAACCCAT TTTGGCAAGT TGGATGGGGG GTGCAGATGT GGCTACGGGA GAAGTTATTC TCAATCGTCA ACGTATCCCC ACTTATGCTT ATCCAGATAC GGCGGCGCGG GTGTTTAGTT ATATGTGGCA ATCTAGCTAC AATCTGCGTG GTATCTACGA AACGCCTGTA TTACCTGTGG ATGCTGCATC TGGTTTACCA GACCGTCATT TAGTAGAGAA TATTATCTCT ACAGCCCGTC AGGCAAAACG GACAATTTTA ACTGAAGATG AATCCAAGCA GATTTTAGCA GCCTATGGCA TCCCGATTGT GGCTACTTGT GTGGCTAAAA CTGAGGATGA GGCGATTAAA TGTGCTGAGA GTATTGGTTA TCCCGTCGTT GTCAAACTGT ATTCCCACAC AATTACCCAT AAAACTGATG TGGGTGGTGT GCAGTTAAAC CTCCCCGATG CAGACGCAGT ACGCCGCGCT TATCGGATGA TTGCTGCATC GGTGGAACAG AAAGTGGGTA GTGAACATTT CTTGGGTGTA ACTGTCCAGC CAATGGTAAA AATGGATGGC TACGAATTGA TTATTGGGAG TAGCCTTGAC CCCCAATTTG GGCCAGTGTT GTTATTTGGT GCTGGTGGAC AATTAGTGGA AGTATTTCAA GATCGAGCGA TCGCCCTTCC TCCCCTTAAT AGTACCTTAG CTCGGCGCAT GATGGAACAC ACCAAGATTT ACAAAGCCCT TAAAGGTGTG CGGGGAAGGC AAAGTGTCGA TATGGAAGGA CTAGAACAAC TAATGGTGGC GTTTAGTCGG TTGGTAGTCG AACAGCGTTG GATTAAGGAA ATTGATATTA ATCCCTTGCT GGCTTCACCT GTGCAGGAGA ATGGGGAAAA CTCCTCACTG ATTGCCCTAG ATGCACGGGT TGTTTTGCAT GAACCAGATG TGACAGAAGA CCAACTACCA AAGTTAGCAA TTCGACCTTA CCCCACCCAA TATGTGGAAC AGTGGACAAT GAAAGACGGG ACTCCCGTAA CCATCCGTCC AATTCGTCCC GAAGATGAGC CGTTGTTAGT ACAATTCCAC AAGACACTTT CCGAGGAAAG CGTTTACTTT CGTTACTTCC ACCTGATGAA ATTGAGTCAT CGCATCACCC ATGAACGACT CACCCGCATC TGCTTTATTG ACTATGACCG AGAAATGGCT TTGGTTATAG AGTCTCAAGG GGAAATTTTG GCAGTTGGGA GGTTAAGTAA ACTGCATGGG ACAAAGACAG CCGAGTTTGC TATGTTAGTA AGCGATCGCT ATCAATGTCA GGGTTTAGGT GCAGAGTTAC TGCGGCGCTT GCTGCAAATT GGACGCGATG AGCAAATCGA ACGCATCACA GCCGATATCC TAGCTGATAA TTATGGGATG CAGCGAGTGT GCGAAAAGCT AGGTTTTAAG CTAGAACGCA CCGCCGAAGC AAGTGTTATG AAAGCGGAAT TGGTTATTGG TCATTAG
|
Protein sequence | MQKSLKPSSD RTSDILQAEK LNPLDAIFAP QSVAIIGASE KVGSVGRTIL WNLISNPFGG TVFPVNPKRH SVLGIKAYPS IASIPETVDL AIIATPAPTV PGIISECVDA GVQGAIIISA GFKEAGAEGI ALERQILAEA RRGNIRIIGP NCLGVMSPRT GLNATFASSM ARSGNVGFLS QSGALCTAIL DWSVRENVGF SAFVSIGSML DVGWGDLIYY LGDDPQTKSI VIYMESIGDA RSFISAAREV ALTKPIIVIK AGRTEAAAKA AASHTGALAG SDAVLDAAFR RCGVLRVNSI SDLFDMAEVL AKQPRPKGPR LTILTNAGGP GVLATDALIE TGGEIAPISP ETITSLDQIL PTHWSHANPI DILGDADPQR YTQALEIAAK DPNSDGLLVI LTPQAMTDPT QTAEQLKPYA QIAGKPILAS WMGGADVATG EVILNRQRIP TYAYPDTAAR VFSYMWQSSY NLRGIYETPV LPVDAASGLP DRHLVENIIS TARQAKRTIL TEDESKQILA AYGIPIVATC VAKTEDEAIK CAESIGYPVV VKLYSHTITH KTDVGGVQLN LPDADAVRRA YRMIAASVEQ KVGSEHFLGV TVQPMVKMDG YELIIGSSLD PQFGPVLLFG AGGQLVEVFQ DRAIALPPLN STLARRMMEH TKIYKALKGV RGRQSVDMEG LEQLMVAFSR LVVEQRWIKE IDINPLLASP VQENGENSSL IALDARVVLH EPDVTEDQLP KLAIRPYPTQ YVEQWTMKDG TPVTIRPIRP EDEPLLVQFH KTLSEESVYF RYFHLMKLSH RITHERLTRI CFIDYDREMA LVIESQGEIL AVGRLSKLHG TKTAEFAMLV SDRYQCQGLG AELLRRLLQI GRDEQIERIT ADILADNYGM QRVCEKLGFK LERTAEASVM KAELVIGH
|
| |