Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3525 |
Symbol | |
ID | 3679549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4378716 |
End bp | 4381223 |
Gene Length | 2508 bp |
Protein Length | 835 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637718877 |
Product | heterocyst differentiation protein |
Protein accession | YP_324027 |
Protein GI | 75909731 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.857828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCAGG AATTTCACAT TTCTGTAACC CCAGTAGGGC AGAATGACTA CTTGGTGCGG ACGGAAGAAG TCGCGCCTGG GGTACCTTTG GCAGAAGAAC TGGTGACATG GCCTGTAGCT GATTGGTTGG CGGCGGCTGG GCATTTGATG AATGACCCAT TGAAATCAGT TTTGCAGGGA GATGCGTTTC TTTCTATGGG GCGGGAAAGT GCGATCGCCC GCAACTCTGT TAATTTAGTA GCATTAGGTC AACAATTATA TAACGCACTG TTTCAAGGTA CTCTCAGAGA TAGCTTGATT ACAGCCCAAG GTATTGCTCA AAACCACCAA CAAGTACTAC GTTTACGGTT GGGTCTCAAG GATACTAGGT TAGCACGTCT GCCGTGGGAA GTGATGCACG CAGGCGATCG CCCCCTAGCC ACAGGCCCTT ATATTGCTTT CTCTCGCTAC CAAAGTGGTA TTTCCCCAAC TTCCCGTGTA CCTTCAGCCA ACAGACTCAA GCTACCAGAA GATGGGGTTG TGAGAGTTTT AATGATCCTT GCATCACCCA CAGATCAAGC GAGTTTGGAT CTGCTGAAAC AAGAATCTAT TCGACTGCAA GCGGAACTAC ATCGTCAGCT ACCGAGATCA ATTGAAGGTG GTAATTATCT CCCAGAAATT GATCTCACCC TACTCAACCA GCCAGGGAGG GAAGAAGTAA CCCAGGCTTT AGAACAAGGC AGATATCATG TTTTACACTA CTCCGGTCAT AGTAACTTGG GGCCGAATGG TGGGGAAATT TATTTGGTTA GTAGCCGGAC TGGCTTAACA GAAACCCTAT GTGGCGACGA TTTAGCGGGT TTGTTGGTTA ATAATAATAT CCAAATGGCG GTGTTTAACT CCTGTTGGGG TACCTACACC GCTAGCTTTG ATAATAGTGG CGACACAGGA GAACGCAACC TTACAGATAG TTTAGTTAAG CGGGGTATTC AAAGCGTCTT GGCCATGTCG GAACGTATTC CTGATGAAGT GGCGCTGACT CTCACGCAAT TGTTTTATCG AAACTTGAGT CAAGGATATC CTGTAGATTT ATGTGTGAGT CGGGTACGTC AGGGATTAAT TTCTGCCTAT GGTTCACACC AGCTTTACTG GGCATTACCA ATTTTATATC TCCAACCGGA ATTTGACGGT TTCCTTAGCC CCAAACTCTC TGCGGCTACA AGTGTTGGTT CCCTGGATGA ATATAGTTCA TCTTTAGCAG CAAATACCGC ATCCACTTAC TCTGGTGTGC TGGATGATGG GGAAATGTCT TTACCAATTG AGGATATGAT GCCCTCTGGT TTAGTGCATG ACTCTTCAGG AGTTGACTGG TTAGGGGAAG AAACTTGGGG TGATCTCGTG GATGAGATTG AGTATGATGA CCCAAGTTAT GCCGAAGATT CGGCTTTTGT TTCCGATATA TTTCGCCAAC TAGACCAGCA AATTATAGGG GATGAAGAAA CTGAAGTACC TCCAGAAGTT AGACAACCTT TACCAGATAG TCATCTAGAA CGACCGATTG CAACTGCACC CAGGGAAGAT TTCTCCAGAA TTGTACCTCC GGCAACTCAT CATACTGCTC AAAACCTGAA CCAAGACTTA GAAAATTTCC GCCTCCTGGC ATCGAGAAAT CGTGTCCGTC GCCAGCGTTG GCAGATTTTC GGGATGATTG GGGTGGGTGC GATCGCCATT ATACTGATCT TTAGTTGGTG GCAAAGTCGC CAGACATCAG TAGTACGCGA CATACCCCCC ATTCCCACCC CATCTTTACC AGTTGAGACT CAGCCCCCAA CGGATTTACG CCAAATGCCT ACGGGAATGG TGACAGCCAT TGCCACAGAA AAATTGAATC AAGGTGATTT GGAACCAGGA TTGGCAGCTG TAGAAGAATT ACTCAATCGC AACGCACTAC AACCCGCCCA AACGGCATTA CAATTAATTC CTGCCAACCA AGAAAAACAG GCATCAGTTA ACTTCCTCAG AGGAAGATTA GCTTGGCAGT CTATCCAAAC AGGAGATAAA AAATACAGTA TTGATGATGC CCGTCGTTAT TGGGAAAGAG CGGTGAAAGC TAACCCAAAA TCGCTTTCAT ATATCAATGC TTTAGGATTT GCCTATTATG CTGAGGGTAA TATCAATCGA GCCAATAACT CTTGGTTTCA AGCAATAAGT TTAGGACTTA AACAAGTGAA TACTGCTGAT GCTGCCGAAG TTTCCCCTAA AGCAGATGTA CCGATTGAGG CTTTACCTGC CTATGCTGGT TTAGCATTGG GGATGTATAA AAATGCCCGT AACCGTAATT TCCCTCCCGA TAAACAGGCG CAATATATAA ATGAAGCCCT CAAATTACGA CAAACAGTTT TAGAAAAAGA TCCCATCAAT TTTCAGCTAG ACGAATTGAG CAAAAATTGG CTATGGACAG AAAATAGTCT CCGAGACTGG CGATCGCTTC TGCAACAAAA AAGCCCCAAG CAAAGTAGGG GAAGATGA
|
Protein sequence | MSQEFHISVT PVGQNDYLVR TEEVAPGVPL AEELVTWPVA DWLAAAGHLM NDPLKSVLQG DAFLSMGRES AIARNSVNLV ALGQQLYNAL FQGTLRDSLI TAQGIAQNHQ QVLRLRLGLK DTRLARLPWE VMHAGDRPLA TGPYIAFSRY QSGISPTSRV PSANRLKLPE DGVVRVLMIL ASPTDQASLD LLKQESIRLQ AELHRQLPRS IEGGNYLPEI DLTLLNQPGR EEVTQALEQG RYHVLHYSGH SNLGPNGGEI YLVSSRTGLT ETLCGDDLAG LLVNNNIQMA VFNSCWGTYT ASFDNSGDTG ERNLTDSLVK RGIQSVLAMS ERIPDEVALT LTQLFYRNLS QGYPVDLCVS RVRQGLISAY GSHQLYWALP ILYLQPEFDG FLSPKLSAAT SVGSLDEYSS SLAANTASTY SGVLDDGEMS LPIEDMMPSG LVHDSSGVDW LGEETWGDLV DEIEYDDPSY AEDSAFVSDI FRQLDQQIIG DEETEVPPEV RQPLPDSHLE RPIATAPRED FSRIVPPATH HTAQNLNQDL ENFRLLASRN RVRRQRWQIF GMIGVGAIAI ILIFSWWQSR QTSVVRDIPP IPTPSLPVET QPPTDLRQMP TGMVTAIATE KLNQGDLEPG LAAVEELLNR NALQPAQTAL QLIPANQEKQ ASVNFLRGRL AWQSIQTGDK KYSIDDARRY WERAVKANPK SLSYINALGF AYYAEGNINR ANNSWFQAIS LGLKQVNTAD AAEVSPKADV PIEALPAYAG LALGMYKNAR NRNFPPDKQA QYINEALKLR QTVLEKDPIN FQLDELSKNW LWTENSLRDW RSLLQQKSPK QSRGR
|
| |