Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0441 |
Symbol | |
ID | 3682602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 563563 |
End bp | 566454 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637715770 |
Product | PEP-utilising enzyme, mobile region |
Protein accession | YP_320962 |
Protein GI | 75906666 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0344] Predicted membrane protein [COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.299176 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGAAC TTTGGGGTGC CTTAGTTATA TTAATTGTCT GTCCCTTCTT GGGCGCGTTA CCTGTAATTG CTTGGATTAC TTACGCGCTC AAGAAGAGAC GTTTAGCTCA AATAGGTACA AGAAATATCA GTGTCTCCGC AGCTTTTTAC CACGGTGGCA CAATTGCTGG GATTTTAGCG GTTGTATCAG AAGCCCTTAA AGGAGTCGCT GCAATTTATC TTGCTCGTGC TTTCTTCCCT GAGGGGTCAT TTTGGGAATT GCTTTCCCTG ATAGCTTTGG TACTCGGTAG GTACTTTATG GGCAGAGGGG CGGGGACAAC CAACGTAGTT TGGGGATTAT TAGTACATGA TCCCCTGCTA ACAATTTTTG TGAGCCTGTT GGCAATTATC AGCTTCACCC TGTTGCAGTC GAGAAATGTG GTAAAGTACG GGGTCTTATT TGTGTTTCCT TTATTTGTGG TGCTTCTCCA CGCCGAAGAC TTTCCTAAAA TTATTAGTGC TGTAGCACTA GCGGGATTGT TATGGTGGAT TTATAAGAAA ATTCCTGACG ACTTGGATTT GTCTTCCCAA GAGGTAGATG CAGAGTCACA AGGCGCATTT GAATATTTAC AGGGCAATGA TGTCATCCTC AGTTTAGATG ATGAGTTAGA TCCGGCGATC GTGGGACACA AAGCCGCTAC TTTATCTCAA ATTAAGCGCT GGGGTTATCA AGTACCAAAG GGTTGGGTAC TCACCCCTGG AGACGATCCA GAAAAGTTAT TAGAATTCCT CCAACCTTCC GAATTATCAC CCATAGTTGT CCGTTCTTCC GCCATTGGGG AAGACTCAGA ACAGGCTTCG GCGGCTGGGC AATATTTAAC AGTGATCCAA GTTGCCAGTT ATCAGCAACT ACAACAAGCC ATTACAGAAG TCAGAGAATC ATATAATTAT TCACCGGCTG TGCAGTATCG GCGCGATCGC GGTTTACCCG ACACAGCCAT GTCAGTCCTG ATTCAACAAC AAGTCCAAAG CGCCTATTCT GGGGTAGCTT TTAGCCGTGA TCCTATTACC CAGCAAGGTG ATGCGGTGAT TATCGAAGCC CTACCCGGTA GCCCTACTCA AGTTGTTTCC GGCAAAGTCA CACCAGAACA ATATCGGGCT TTTGTGCTGG AGGCCGATAA TTTGTCTTCG GTGAAACTAG AAGGTACCGG AAGAGTACCC CAGGCATTAA TTAAACAAGT GGCTTACTTA GCTCGTCGGC TGGAAAAGCG TTATCTGGGA GTACCTCAAG ATATCGAGTG GAGTTACGAC GGTCAAACCC TGTGGTTATT GCAAGCAAGA CCAATCACCA CCTTATTACC CATTTGGACA AGGAAAATCG CGGCGGAAGT GATTCCAGGT GTGGTGCATC CCTTAACTTG GTCGATTAAT CGTCCCTTAA CTTGTAGCGT TTGGGGTGAT ATTTTTACGA TAGTGTTAGG CGATCGCTCT ACAGGATTGG ATTTTACAGA AACGGCAACC CTGCACTACT CTAGAGCCTA CTTTAACGCC TCTCTTCTAG GAGAAATTTT CCTCAGGATG GGATTACCGC CAGAAAGTCT AGAGTTTTTA ACGAGGGGTG CAAAAATCAG TAAACCGCCG TTGCAGTCCA CCTTACAAAA TCTGCCGGGA TTATTCAAGT TACTGAAACA AGAACTCAAT TTAGAGAAAG ACTTTAAACA AGATTACCAA AAGGTATTTA TTCCGGGGTT ATCTCAATTA GCCAATGTTT CCCTAGAGGA ACAAGAGATA GGAGAACTGC TAGCCGGGAT TGATTTCAAC CTAGAATTGA TGCGCCGTGG CACTTATTAC AGCATTTTAG CTCCCCTGAG TGCCGCTATC AGACAGGGAG TTTTTCGGGT GAAAGATGAG CAAATTGATA ACAGCGTCAC CCCAGAAGTA GCCGCTTTAC GCTCACTCAG AGCTTTAGCT GTAGATGCCA AACAGATATT ACCAGAGTGT GAACCTGAGC AAGTCTTCGA TACATTGGCG CAAGTCCCAG GGGGAGAAAA AATCCTCTAT GAATTTAACG AATTATTGGA AGATTACGGT TATTTGAGTG ATGTCGGCAC AAATATCGCT GTCCCCACTT GGAAAGAAGA CCCCCAACCC ATCAAACAGT TATTTGTCCA GTTAATTCAA CTCAGTGAGC CAGAAAAAGC CGAATTAGAA GCCAAAAAAG TTGTCGCCCC GAAACGCAAA CGGGGGACTG TACAACGACG AGTAGATATT AAAGGGCGAG TCACCGAGCT TTATTCGCGC CTATTAGCCG AATTACGGTG GAGATTCGTG GCTTTAGAAA AAATTCTGCT GAAATCAGGA GTACTCAAGC AAGTAGGGGA TATCTTCTTT TTAGAACTCG ATGAATTACG AGATTTATTA GCAGATACCA ATAATGAGTT AAGAGTTAGC TTAAACGAAC TAATCCAATT TAGGCGATCG CAATTCCACC AAGACAGTCA AATTGAACAA GTCCCCCTGG TAGTCTACGG TAATATACCC CCCCATCCTT CAGAAACCAC AGACGTATAC TCTGACCAAA TATTACAAGG TATTGCCGCC AGCCACGGAC AAGCCGAAGG CAGAATCAAA GTGGTGCGAA ACTTACAGAA CTTACCAGAC ATCGATAAAG ATACAATACT AGTAGTACCC TATACAGATT CCGGCTGGGC CCCTCTCTTA GTCAGAGCCG GAGGATTAGT TGCAGAAGCC GGCGGTAGAC TTTCCCACGG GGCGATCGTC GCACGAGAAT ACGGTATACC TGCGGTGATG GATGTTAAAG GCGCAACCTG GATTCTGCAA GATGGTCAAC GAGTCAGAAT CGACGGGTCT AGGGGGATTG TGGAACTATC CAACGATTTA CGACCAGAAT GA
|
Protein sequence | MRELWGALVI LIVCPFLGAL PVIAWITYAL KKRRLAQIGT RNISVSAAFY HGGTIAGILA VVSEALKGVA AIYLARAFFP EGSFWELLSL IALVLGRYFM GRGAGTTNVV WGLLVHDPLL TIFVSLLAII SFTLLQSRNV VKYGVLFVFP LFVVLLHAED FPKIISAVAL AGLLWWIYKK IPDDLDLSSQ EVDAESQGAF EYLQGNDVIL SLDDELDPAI VGHKAATLSQ IKRWGYQVPK GWVLTPGDDP EKLLEFLQPS ELSPIVVRSS AIGEDSEQAS AAGQYLTVIQ VASYQQLQQA ITEVRESYNY SPAVQYRRDR GLPDTAMSVL IQQQVQSAYS GVAFSRDPIT QQGDAVIIEA LPGSPTQVVS GKVTPEQYRA FVLEADNLSS VKLEGTGRVP QALIKQVAYL ARRLEKRYLG VPQDIEWSYD GQTLWLLQAR PITTLLPIWT RKIAAEVIPG VVHPLTWSIN RPLTCSVWGD IFTIVLGDRS TGLDFTETAT LHYSRAYFNA SLLGEIFLRM GLPPESLEFL TRGAKISKPP LQSTLQNLPG LFKLLKQELN LEKDFKQDYQ KVFIPGLSQL ANVSLEEQEI GELLAGIDFN LELMRRGTYY SILAPLSAAI RQGVFRVKDE QIDNSVTPEV AALRSLRALA VDAKQILPEC EPEQVFDTLA QVPGGEKILY EFNELLEDYG YLSDVGTNIA VPTWKEDPQP IKQLFVQLIQ LSEPEKAELE AKKVVAPKRK RGTVQRRVDI KGRVTELYSR LLAELRWRFV ALEKILLKSG VLKQVGDIFF LELDELRDLL ADTNNELRVS LNELIQFRRS QFHQDSQIEQ VPLVVYGNIP PHPSETTDVY SDQILQGIAA SHGQAEGRIK VVRNLQNLPD IDKDTILVVP YTDSGWAPLL VRAGGLVAEA GGRLSHGAIV AREYGIPAVM DVKGATWILQ DGQRVRIDGS RGIVELSNDL RPE
|
| |