Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1848 |
Symbol | |
ID | 3681839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2300833 |
End bp | 2304090 |
Gene Length | 3258 bp |
Protein Length | 1085 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637717188 |
Product | hypothetical protein |
Protein accession | YP_322365 |
Protein GI | 75908069 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.47429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAAG TAAGTGCAAA TGAACTGCAA TATCGTGGGA TTAACCGCAC AATTTGTATT GGCTTAGGTG GTACTGGACG AGATGTTTTG ATGCGAATTA GACGGTTAAT TGTTGACCGT TATGGAGATT TAAGCAATCT GCCAATTGTA AGTTTTGTTC ATCTAGATAC TGATAAAGCT GCAACACAAG TGACTGGCAT TCGTACAGGA AGTACTTATC ATGGTGTTGA TCTCAGCTTT CGAGAAGCCG AAAAAGTTAG CGCCACTATG TCCGCCAAGG AAGTAACGAT GTTTGTGGAA GGACTAGAAA GGCGCTCAGA ATATACTCGT TACGGCCCCT ACGACCATAT TGCTAGATGG TTTCCTCCCC AACTGTTGCG AAATATTAAA GCTGTGGAGG AAGGTGCAAA AGGAATTAGA CCTGTAGGGA GACTAGCTTT TTTTCATAAT TATCAAAAGA TAAAAATAGC GATTGAAACC GCAGAAAGAC TTAGTAGGGG ACATGATGCT TTATTGCTGA GAAAGGGGTT AAGAGTTGAA CCAGGATTGA ATATTTTTGT GATTGGTTCT CTGTGTGGTG GTACAGGGAG CGGTATGTTT TTGGATGTTG CTTATAGTCT TAGACATCTT TATGGTGAAC AAGGCGCTCA GATTGTCAGC TATTTAGTGA TTAGTCCAGA ATTATATGGT AATACCCCTA ATATGAGTGC TAATACTTAT GCTGCTTTGA AAGAGTTAAA TTACTACAGT ACTCCAGGGA CAAAATTTGC AGCCTGTTAT GATATTGAAA ATCTAGAATT TCTACAAGAA AAGCGTCCGC CTTTTGACTA CACTTATTTA GTTTCTCATC AGACAGGAGG CGAATATCAA ATTCTTGATC AAGGTAAGTT ATGTAATGTG ATCGCTCACA AGATAGCTCT AGATTTTTCC GGTGAGTTAG CACCTGTAAT TAAAGGACAT AGAGATAATT TTCTCCAACA TATAATTCAG TGGGATAAAC ATCCACGTCC TAATGGTCAG AGGTATTTAA CATTTGGGTT AGCGGCGATT TATTTTCCCC GTGACACTAT CGTGGAAATT GCCTTAATAA GGGTTAGTTT AGCATTAGTA AAGTTTTGGT TAAATGGCAA AGGTCAAAGT CCAGATCCTC AGAAACTACT GGATCAATTT CTGATTCAAT CTCGTTGGCA TAATGACTTA GCCAAAAAAG ACGGCTTAAC TACGAAAATA GCAGAATCAG TAGAGGATAC AAATAAAAAC TTTAGTAGCA ATATTAGTAC CTGGAGAAGT AAATTAGAGC GATCAATTTC TGAATGTCAG AATAAAGATG ATCGTAACGG TATTCGTCAA CAGTTACCAA GGGAGTTTCG AGAGCAATTT CGGAAAGTGC AGCCGGGGGA AACAGAAAAT GTCCGAGGTA TTTGGCTGAC AAAATTGCTC CAGTCTTCTC CAAATATCAC CAAGGAACTA AAGACTAATA TTGACGATTA TTTAATTCAG TTACTCACGC CAAGTGAGCC TATTTTCTCT ATTAAAAGCA GTCGTGATTG GCTAGATGCT TTACAACATG AACTACATAA CTATCAATTC AATCTGCAAG AAGCAATTAC CGATTTTGGT GGGATGAAAC GCGCGGAGGA TATTGATAAA AAATGGCGAG ATGCCGAGCA AATGATTGAA GATATTGAGC ATAAAATTGG TATTCCCATA ATTAATACTA AGAATAGCCA AGTGCAAGCT GAAGTTAAAA GGGTAGTGCA AGAAGTCTGC AAACTCATTA AACATAACTT TGATTTTACC GTCTTTCAAG AGGCTCTAAA AATAGTCAAT GAATTACAAA AACACGTTCA GGAAAGAGGG AATCAAGTTA CTGCTTTTAG TAGAGTCATT GAAAATTTGC AAACTTTCTA TGAGAAGCAA GATAGTGATT TAAGACAGTT AAACTTTGAT GAAATGAGTG GAGAAGCCAT ATTTGATAGT GAAGATATTG ATCGCTGTTA TCAAACTATG TTGCCAGAAG ATGATCTTCG CAGACAATTG GTATTAGCTA GCTCGGAAAT TACGGAACCT GCTGGAAGGG GACAATCTTT GGCAAGTTTT ATAGATAGAG AAAGAACTAC GCCAGAACAG CTACAAACAG AAATTGACCT AAAGGTTGAC AGTTTATTTG CTTCTCGCGT TACTAATATT GTCAACTCTG TGATTAAGCG TTTCATGCAA AAATATCCTT TAGCAGCGCG TTCGACTCGG TTAGCGCAAG TTATGCAAGA AGCTGAACCT CTGCTGAGGC TGAATTTAAG TGACCCTTAT TTCCGTGAAG ACCCGGCGAA AAGTAGTAAA TTAATTGGGT TTAAGGATAA GGATGAATTG GAGGTACGAC AGTTTAAAAC TGTATTAGCA CAAGATTTAG GTATTGAATC AAGTGTGATA AAAGCGACAC AATCTGAAGA TGAGATTTTA ATTGTCAATG AGTATGCTGG TTTTCCTCTC AGGCTAATTA GTAGTCTGGA GAGGATGAGA AACCCCTATC TACGTGAACA AAATTCTGCC ACATCTTTTC TGCATAACGA TTACCAAGTA GCATTTCCAG ATATTATCCC CCCAGATGCG ATCGCAATGG AAAAACTGGA AGATGTCTTC TATCCTTGTT TGGCCTTTAG GTTACTCAAG GAAAACCAAG AAAATCAACA ATTAGAATTT CAATATTATG ATTCCTTGCG TGATAGTTAC AATACTGCTA CTTTGAGTCC AGAGTGGAGT CAAGCCTTGG AAGAATTAGC TAACCGCAAC GACATGACTG AGGCTTTGCT ACAGCTTTTA GAGCGAGAAA TTTCTGTAAT TTCTGGACAA CCAGAACTTT GGGAAAATCA GTATTTACCA AAACTAAGGC AATTTGTGCA GGCAGTAGAT GATTTATCAG AAGATAGTCC CAATTATCCC TACAAACTCG CAGTAGTAGG AACATCCGCC AGCACAGATC CTACAGTTAA AGAAGGAATT ATTCATCGCT TTCGGAGAAA AATGAATGAG CGATTTAGCA TATCTCAAAG TCGCGCTTTT GCACCAAATA ATAATACATC AATGCAAACA GCTATTGCTG GTGAAATAGT CGTGGATATG CCTGTTGATA CTACTGATAA TAGAGTCAGG CGGCGCTTAG AATTAGAGCG GTTGAAACAA GATTTAGATG AAGATTTTAT TACTCAAGAT GAATATGAGC GTGAAAAACA AAGGATTTTT GCTCAATATC CCCTTTAG
|
Protein sequence | MNQVSANELQ YRGINRTICI GLGGTGRDVL MRIRRLIVDR YGDLSNLPIV SFVHLDTDKA ATQVTGIRTG STYHGVDLSF REAEKVSATM SAKEVTMFVE GLERRSEYTR YGPYDHIARW FPPQLLRNIK AVEEGAKGIR PVGRLAFFHN YQKIKIAIET AERLSRGHDA LLLRKGLRVE PGLNIFVIGS LCGGTGSGMF LDVAYSLRHL YGEQGAQIVS YLVISPELYG NTPNMSANTY AALKELNYYS TPGTKFAACY DIENLEFLQE KRPPFDYTYL VSHQTGGEYQ ILDQGKLCNV IAHKIALDFS GELAPVIKGH RDNFLQHIIQ WDKHPRPNGQ RYLTFGLAAI YFPRDTIVEI ALIRVSLALV KFWLNGKGQS PDPQKLLDQF LIQSRWHNDL AKKDGLTTKI AESVEDTNKN FSSNISTWRS KLERSISECQ NKDDRNGIRQ QLPREFREQF RKVQPGETEN VRGIWLTKLL QSSPNITKEL KTNIDDYLIQ LLTPSEPIFS IKSSRDWLDA LQHELHNYQF NLQEAITDFG GMKRAEDIDK KWRDAEQMIE DIEHKIGIPI INTKNSQVQA EVKRVVQEVC KLIKHNFDFT VFQEALKIVN ELQKHVQERG NQVTAFSRVI ENLQTFYEKQ DSDLRQLNFD EMSGEAIFDS EDIDRCYQTM LPEDDLRRQL VLASSEITEP AGRGQSLASF IDRERTTPEQ LQTEIDLKVD SLFASRVTNI VNSVIKRFMQ KYPLAARSTR LAQVMQEAEP LLRLNLSDPY FREDPAKSSK LIGFKDKDEL EVRQFKTVLA QDLGIESSVI KATQSEDEIL IVNEYAGFPL RLISSLERMR NPYLREQNSA TSFLHNDYQV AFPDIIPPDA IAMEKLEDVF YPCLAFRLLK ENQENQQLEF QYYDSLRDSY NTATLSPEWS QALEELANRN DMTEALLQLL EREISVISGQ PELWENQYLP KLRQFVQAVD DLSEDSPNYP YKLAVVGTSA STDPTVKEGI IHRFRRKMNE RFSISQSRAF APNNNTSMQT AIAGEIVVDM PVDTTDNRVR RRLELERLKQ DLDEDFITQD EYEREKQRIF AQYPL
|
| |