Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2039 |
Symbol | |
ID | 3680980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2521347 |
End bp | 2524355 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637717384 |
Product | hypothetical protein |
Protein accession | YP_322556 |
Protein GI | 75908260 |
COG category | [C] Energy production and conversion |
COG ID | [COG3202] ATP/ADP translocase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.373695 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTGA AAAATTACTC GTTGGCGGCA AATCCAAGTT GGCTGGCACA ACTTTTAAAA TGGGTGAATC TACGACCAGA GGAAGGCGAA CGGACTTGGA TGATGTTTGC CTTTTACACG ACTGTATCTG TGGGGTTGCG ATGGGCTGAG GATAGTACAG TAGCACTATT TTTGGATGAA TATGGCGCTG GGCCGTTGCC TTGGATGTAT ATTGCCAGTG CGGTTATGGG TATGGCGCTG GTTTTTGTCT ATTCTTGGTT GCAAAAGATT TTTCCTTTGC GCTGGGTGGT GGTGGCGATC GCACCTTGTA TGGTCGTGCC ATTAATCTTG TTAGTATTAT TACGCTGGGG CATCCATCTT TCTTACTTTT CAGTCATTGT TGTATTTTTA CTACGGTTGT GGGTAGATTC CCTTTATGTA GTTAATGATC TAAATACTTC CATTGTTGCT AACCAATTAT TTAATATTCG AGAAATTAAG CGCACTTACC CACTGATTAG TAGCGGATTA TTAGTGGCTG ATGTGATCAG TGGCTTTAGT TTGCCTTGGT TGCTGGAGTT CGCCAAACTC AACCGCGTGA TTATGATCGC CTGTGGAGTG ATTATGATTG GCTCGGCAAT TCTCTGCTAT TTAAGTTATC AGTATCGCAG TTCTTTTCCC GAAGCACCAC AGCGCCTAAT TCCTCAAGAA CAAGCCTCTC GGCATCGGCG TATCCAAGCA CCACTCAAGC GTTATACTTT ACAGCTATTT GCCTTTGTTG GGCTGTTACA AATTATTGGT TTATTAGTTG ATTTTCAATA TCTCCAAGAA CTAAAAATCA ACTTAGGCGA TCGCGAACTA GCAGGCTTTT TAGGTATATT TGGTGGGATT GTTGGCTTGT GTGAATTAGG AACTCAGTGG TTCGTTTCCA GCCGCTTGAT CGAACGCTTT GGAGTGTTCT TTACTGCTGC ACTTTTGCCT GTGGCTGTAG GCTTTGTCGT TCCTGGGATG ATTGTTGTAC TTTACTTACT ACCAGGAATA CAATCACTGG CATTTTTCTG GGGATTAGTC GGGTTGAAGT TCTTTGATGA ACTGCTACGC TACACCTTCG TTATTAGTAG TGGGCCGTCA CTATATCAAC CGATACCAGA ACGCATTCGT AGCCGGATGC AGGCATTATC AGGAGGAACG GCGGAAGCGA TCGCCACTGG TACTGCTGGT ATCATCATTG TGATTACCTT GTTTGTGTGT GGGTTATTTG TACCTGCAAC AATGCAAAAG TGGGTGTTTA TCTGCGAAAC AATGGTAGTA GCCGTTGCCT GCTTGAAGGT AGTATGGTTA TTGCGATCGC GTTATGTTGA TTTGTTAGTC TTGAGCGCCG ATAGAGGCGA ACTAAGTGCC TCTAATGTGG GTTTACGTGC CTTTAAACAA GGGGTAGTCA AAGCTTTAGA AGAAAAGGGC AACACCGCAG ATAAACACTC ATGTATTGAA TTATTAGCCC AAATTGACCC CCAAGGGGCA GCAGATGTTC TCTCACCAAT TTTATTTAAG CTAACGCCAG ATTTACAACG CCATAGTTTA GAGGTGATGC TAGGGGCTGA TGCTAATCCC ACCCATGTAT CGGAAGTAAA ATGGTTGTTA GAGCGTCATC CAGACAATGT TAATCCCGAA GTTTTTGCCC TAGCATTACG TTATGTTTGG CTAGCTGACC CCAATCCCAA TTTAAGCCAA CTGGAAGAAT ACCTCAACCA GCGCCACCAC TCATTAACTC GCGCCACAGC TGCGGCTTTG CTATTGCGTC AGGGAACACC AATCCAAAAA GTAGCCGCCA CCAAGACTTT AAGCCGGATG CTAACCCATA AGCAGGAACG AGAACGGGTT AATGGTGTCA AAGCCCTCCG AGAAGTGGTT TATTTACAGA CGTTACGGAT TCATATTCCT AATTTGTTAC AGGATGAATC CTTAAGGGTG CGTTGTGCTG TCTTAGAAAT GATTGCTGCA ACACGCATGG AAGAATACTA TTCCTCACTG CTGACAGCAC TTTATTACAA ATCAACCCGT GCTACCGCCA TGCGTGCCTT AATCAAAATG GAGAATGAAG CATTGGATAT GCTGTTACAA CTGGCTACCA ATGCCCATAA GCCAGAAGTA TTAAGGATGT ATGCTTGGCG TACCATAGGA CAAATTTCTA CTACAGAAGC TGTAGAGACT TTATGGTTGC AGTTGGAATC TTCTTGGGGT GCTACCAGGT ACCATATTCT GCGTAGCTTG TTGAAAATTC AGAAACAATC AGAAATTACC AATTTAGTAG ATAGGTTTCA ACACAGTCGG GTAGAAAGTC TCATTGATCA GGAATTACGA TTTTTGGGTG AGATTTATGC AGCTTATATA GACTTACGGC CACAATTAAA TTTAGATAAT AATCAAACAA ATTCCAGAGC TTTGATTGTT GCGGATTTAC TCCAACGTGC CTTGGCAGAA ATGGAATGGG ATATTCGGGA ACGTTTATTA TTATTATTAA AATTACTGTA TTCACCAGAT AAAATGCAGG CAGCCGCTTT TAATTTGCGG TCTGAATCGA TGGTGAATTT AGCACGAGGA TTGGAAATAT TAGAACATAC AGTAAATTTG CCCAGAAAAT CATTGTTGTT AAATCTCTTA GATAAGCGAT CGCACCAAGA AAAGCTGCAT GATCTTTTAG AGGCAGAATT CACAGAATAT CAACCGTTGT CAGCTAGTGA AAGATTGCGG AAGATGCTGA CTTTAGGCAC TTTTCTCTCT GATTGGTGCT TGGCTTGCTG TTTTCATTTT GCCCAGGTTA ACCGGATTCG TTTGACAATT AACGAAATTC TGATGGCTTT GCGTCATCCT ACGGGCTTTG TACGGGAAGC GGCGATCGCT TATTTGAGTG TGGTTTCACA TCGCGTCCTC TTAGAACTTC TGCCCAAATT AAAAAAAGAT TCTCATCCCC TAGTAGCAGC ACAAATTCAG GAATTGATCA AAAAATCCGC CATCAAAATT AATCAATAA
|
Protein sequence | MELKNYSLAA NPSWLAQLLK WVNLRPEEGE RTWMMFAFYT TVSVGLRWAE DSTVALFLDE YGAGPLPWMY IASAVMGMAL VFVYSWLQKI FPLRWVVVAI APCMVVPLIL LVLLRWGIHL SYFSVIVVFL LRLWVDSLYV VNDLNTSIVA NQLFNIREIK RTYPLISSGL LVADVISGFS LPWLLEFAKL NRVIMIACGV IMIGSAILCY LSYQYRSSFP EAPQRLIPQE QASRHRRIQA PLKRYTLQLF AFVGLLQIIG LLVDFQYLQE LKINLGDREL AGFLGIFGGI VGLCELGTQW FVSSRLIERF GVFFTAALLP VAVGFVVPGM IVVLYLLPGI QSLAFFWGLV GLKFFDELLR YTFVISSGPS LYQPIPERIR SRMQALSGGT AEAIATGTAG IIIVITLFVC GLFVPATMQK WVFICETMVV AVACLKVVWL LRSRYVDLLV LSADRGELSA SNVGLRAFKQ GVVKALEEKG NTADKHSCIE LLAQIDPQGA ADVLSPILFK LTPDLQRHSL EVMLGADANP THVSEVKWLL ERHPDNVNPE VFALALRYVW LADPNPNLSQ LEEYLNQRHH SLTRATAAAL LLRQGTPIQK VAATKTLSRM LTHKQERERV NGVKALREVV YLQTLRIHIP NLLQDESLRV RCAVLEMIAA TRMEEYYSSL LTALYYKSTR ATAMRALIKM ENEALDMLLQ LATNAHKPEV LRMYAWRTIG QISTTEAVET LWLQLESSWG ATRYHILRSL LKIQKQSEIT NLVDRFQHSR VESLIDQELR FLGEIYAAYI DLRPQLNLDN NQTNSRALIV ADLLQRALAE MEWDIRERLL LLLKLLYSPD KMQAAAFNLR SESMVNLARG LEILEHTVNL PRKSLLLNLL DKRSHQEKLH DLLEAEFTEY QPLSASERLR KMLTLGTFLS DWCLACCFHF AQVNRIRLTI NEILMALRHP TGFVREAAIA YLSVVSHRVL LELLPKLKKD SHPLVAAQIQ ELIKKSAIKI NQ
|
| |