Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3571 |
Symbol | |
ID | 3679517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 4447156 |
End bp | 4449147 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637718922 |
Product | hypothetical protein |
Protein accession | YP_324072 |
Protein GI | 75909776 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.68126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00704751 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGTACG CAATACCAGA TGATATCCAA AAGTTACAAT TGAGGATAAA AAGACGATTC AATAAAACAT TTAAGTATTT AAATTATAAA TTTTTCAAAT CATGGATTTA TTTATTTATT GCTACTATCT CATTTCTATT TATTACTATT GTTTCAACAG ATTTAACATT CAGTCAAGTT CCAGAGGATA CCTATAAAAG AGATTTGATC AGTGATATTT TTGCTGGCAG TCAGAAATCT ATACAAAACT ATATTGAGGG AATGTATGTT GCGCCTAATG GCACAGTTTA TACAAATAGT CATGAGGATA AAGCAGGAGC CGAAGCATCA ATATATAAAG ATGGTAATGT AATTGGGGTT CTCAATGATC TTCATGGTTG GAGTCGTCAT GGTGGTAAAG CTGTCACAGC AAATAGTAAG TATATTTACA TTGCTATGAG CCAAGGATTT GTCGGCAAAA TAGACAAAGA TTATCCACCA GAGGGAACAA CTTGGTATTG TGTGAGACGT TATAACTTAT CTGGTAAACC AGCGCCCTTT GCTAATGGAC GTGGTTGGGA TAAGAGTATG TTAATCGTCA ATACTAAAAG TGAGGTGACT GGATTAGCAA CGAAAGATAA GAATTTATAT GTCAGTGATG CAGCGAATAA TAAAATTCGC GTCCATAACA CTGAGACAAT GAAAGAATTA CGCAACTTTA GTTTCCTGAG ACCAGGGGAT ATAACTATTG ATAAGCAGGG GAATTTGTGG ATTATTCAAA AGCAAGATGC CAATAATCCT GCTAAAATTT TGCGTTATTC TTCATCAGGA AAACAATTAC CGCAAAAAAT AACTAATGTT GTTTTACCCG GAGCGATCGC ACTTGATAAT CAAGGCAGAC TGTTAGTAGC GGAAAATGGG ACACTTCAGC AAGTATTGAT TTATAACATT CAGGATCAAC CAGTCCAGGT CGGCACTTTT GGCCACACAG GGGGGATTTA TGGAGGTGTT CCTGGTGAAG TCCAAGATTT AAAGTTTTAT GGACTCACAG GAGTCGGCGC AGATACTCAG GGGAATTTGT ATATCAATAA TAATGGGTTC AATAATTCCG GCACAGATTT AAGAAAATTT TCGCCATCAG GAAAGCAACT ATGGCGATTA TTAGGATTAT CCTTAGTCGA AAATGCCGAC GTTGACCCCA CTACAGATGG ACGAGAAGTT TTTACCAAGT ATGAACACTT TTTGATAGAT AGTAGCAAGC CTAAGGGTCA ACAATTGACT TATAAAGCTT ACACCTTAAA CGGTTTTAAA TATCCTCAAG ATCCACGACT GCATATATCC CCCGATGCTA CCTTTGTACG TCGAATCAAT GGTGAAAGAT TTTTGTTTCT CTTAGATCCG TACAGTAACG CCTTAGAGAT TTATCGATTT AATCCCAGCA AAGATGGGAA TATAGCTATT CCAGCAGGGA TGTTTGTTGG TAAAAATAAC GAAGGTAAAT CAGCAATCTC AGGGACTTGG CCACCTCAGC AACCAAAAAC AGGAAAATGG ATTTGGCGCG ATCGCAACGG TAACGGCGCT TTCGATAAAG AAGAATATGA TCATAGTGAA GATTCTTCCT CTGTTACAGG ATGGTGGGTG GATAGTAAAG GCGATGTGTG GACAACTCTG GGCGATCGAC AAGGTATTCG CCATTATTTC TTGCAAGGAC TAGACACTAA CGGTAATCCC ATCTATACCT ATAGTTCCAT GCAAAAAGAA ACAACTCCCC GCATTTTTAC AGATTTACGG CGAATCAAAT ACTTTCCTGA AAGTGACACC ATGTATCTGT CCGGCTTTAC AGTTAAAAAT CCAGCAACCT TTGTAGATCC CCTAGCCGCA GGCTCTGAAA TTGCCCGTTT TGACAATTGG AGTCAAGATA ATCGTATTCT CCGTTGGCGA ATTGTAGTTC CTAACGACAC CATCCGTAAA CGTGAAATTA TTACTGCGTC TACGCACTCA GTCCCAGATT AA
|
Protein sequence | MKYAIPDDIQ KLQLRIKRRF NKTFKYLNYK FFKSWIYLFI ATISFLFITI VSTDLTFSQV PEDTYKRDLI SDIFAGSQKS IQNYIEGMYV APNGTVYTNS HEDKAGAEAS IYKDGNVIGV LNDLHGWSRH GGKAVTANSK YIYIAMSQGF VGKIDKDYPP EGTTWYCVRR YNLSGKPAPF ANGRGWDKSM LIVNTKSEVT GLATKDKNLY VSDAANNKIR VHNTETMKEL RNFSFLRPGD ITIDKQGNLW IIQKQDANNP AKILRYSSSG KQLPQKITNV VLPGAIALDN QGRLLVAENG TLQQVLIYNI QDQPVQVGTF GHTGGIYGGV PGEVQDLKFY GLTGVGADTQ GNLYINNNGF NNSGTDLRKF SPSGKQLWRL LGLSLVENAD VDPTTDGREV FTKYEHFLID SSKPKGQQLT YKAYTLNGFK YPQDPRLHIS PDATFVRRIN GERFLFLLDP YSNALEIYRF NPSKDGNIAI PAGMFVGKNN EGKSAISGTW PPQQPKTGKW IWRDRNGNGA FDKEEYDHSE DSSSVTGWWV DSKGDVWTTL GDRQGIRHYF LQGLDTNGNP IYTYSSMQKE TTPRIFTDLR RIKYFPESDT MYLSGFTVKN PATFVDPLAA GSEIARFDNW SQDNRILRWR IVVPNDTIRK REIITASTHS VPD
|
| |