Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3502 |
Symbol | |
ID | 3679775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4342244 |
End bp | 4345483 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637718854 |
Product | Type I site-specific deoxyribonuclease HsdR |
Protein accession | YP_324004 |
Protein GI | 75909708 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.23995 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAACT TTATTTCTGA AGATGACATC GAACAAGCGG TACTTGCCAA ATTAAAGCAG CAAGGCTTTA AATTACTCAA CTGCTTCACA ACCAACCCCG ACGATCTCAA CGACAGATCC AACCGCACAG ATAAAAGCGA GGTCATTTTT AGCGATCGCC TGAAAGCAGC AGCAATTCGC CTCAACCCCA ACTTACCAGC AGAAGCGATC GCCCAAGGTT TAGAAACTCT TACCAATAAA CGCCAGTTGA TGTCTCCCAT CGCCGCAAAT CGGGAAATAG ACAGGCTAAT TCGTGATGGT ATCCCCATTG AATATGAAAA CGCCCAAGGT AGAAGCGAAC ACGGGAAAGT GCGCTTAATA GATTTTAATG ACTTGAGTCA CAACGGAAAT AATGAATATT TAGCCGTTTC CCAACTTTGG ATTAAAGGAG AAAGAAACTA TCGCCGTCCT GATATCTTGC TATATATCAA CGGTTTACCC CTGGTTTTCA TCGAACTAAA AAACTCCAAC GTCAAACTGC AAACAGCCTT TGACGACAAC CTCAGCAACT ACAAAAAAGA TATTCCTCAA CTCTTTCTCA CCAACGCCTT CTGCATTCTC TCCAACGCCC TAGAAACCAA AATTGGCAGT TTTACCGCAG AGTGGGAACA CTTTTTTAAC TGGTTACGGG TGGAAGATGA AAAGGAAAAA ATAGATCGTC AGCAGATTCA AGAATCTGGT ACTAGCCTAG AACTAGCTAT AGATGGGCTT TGTCACCCTT CTAAACTACT CGACTACATC GAAAACTTTA TTCTCTATCA CAAAGAAACT CAGAAAATCA TCGCCCAAAA CCACCAGTTC ATCGGTGTGA ATCGCGCTAT TGATGCTTTT GACCACCGGG AAGTAAAACA GGGCAAATTA GGAGTTTTCT GGCACACACA AGGCTCTGGT AAAAGCTTTT CGATGATTTT TTATGCCCGA AAGATTTTCC GCAAGCTCAC GGGCAATTTT ACCTTTGTGA TTATTACCGA CCGCGATGAT TTAGATGGGC AGATTTACCG CAATTTCTTA AACACCGAAA CAGTCAAAGA GAAAGAAGCA GCTCGACCCA AAAACAGTAA ACAACTCCGG GAGTTTCTTT CCCTCAACAA GCGAATAGTT TTTACCCTGA TTCAAAAGTT CCGCTACGAG AAGGGGGAGG AGTATCCCAT ATTGAGCGAT CGCAATGACA TTATTGTAAT TGTCGATGAA GCACACCGCA CCCAGTATAA AACCTTAGCG GAAAATATGC GGGCAGGTTT ACCCAATGCC AATTATCTCG CCTTCACAGG TACTCCCCTA TTAGGTAAAA AGCGCAAGAC TAACGAATGG TTTGGGGATT ATGTCAGCGA GTATAATTTT TCTCAGTCAG TAGATGATGG CGCAACCGTC CCCCTGTTTT ACCAGAAGCG AGTACCAGAG GTACTGATTC AAAACGAGGA TTTAAGCGAG GAATTTTACC AAATCCTTGA AGATGAAAAC CTTGATGACA CCCAGCAAGC CAAGCTAGAA AGGAAATTTG CCCGCGAAAC CGAAGTTATT AAACGTGATG ATAGGTTAGA GACAATAGCA AAAGATATTG TTTACCACTT TCCTCGTCGT GGCTATCTTG GTAAAGGCTT GGTGGTTTCT CTTGATAAAT TCACTGCGGT GAAAATGTAC GACAAAGTAC AGTACCACTG GAAAGCAGAA ATTAAAAATC TGGTGGGACG AATTAAGAAA TCTAGTAATG ATATTGAGAA AATTCGCCTG AAAAAGATTC TCGACTTTAT GCGAGAAACA GAAATGGCTG TGGTGATTAG TCTGGAAGGA AGCGACAGCG AAGTTGAAAA ATTTGCGAAA CAAGGACTTG ATATCAAACC CCACCGCGAA AGAATGAATA CCGTTGATGC ACAGGGACAT GATATCGAGT ATCAATTTAA AGATGAGGAA AATTCCCTGC GGTTGGTGTT TGTCTGCGCC ATGTGGCTGA CGGGTTTCGA TGCGCCCACT CTCTCGACGC TTTACCTTGA CAAGCCCATG AAAGACCACA CGCTGATGCA AACTATCGCC CGTGCGAACC GCATCACCTC ATATCAGATT CACAACGTCA CCAAAACCAA CGGCGAAATT ATTGATTATT ACAACGTCTT CCGCAACATG AAGCAAGCCT TGGCAGATTA CGCTTTGGGC GATGATGGAG ACAAAGAAGA CTCACCCATT CAAGAAAAGT CTAACCTGTT TGAGTTACTT GATGATGCGA TCGCCCAAGG ACTGAATTTT TGTCGGGAAC TGGGGATTGA TTTAGAAAGC ATCCTCAAAA TTCAGGAAAC TTTCGAGAAA TTAGGACAAT TTAACCAGTT TGCTGACACT CTACTGCAAA AAGATGAATG GCGTAAAGGA TTTTTTGTCT ATCAAAATAC TATTACCTCA CTCTATGAAG CCTGTAAACC AGAGATAATC AAGAAACCAC GTCCATTGGT TTTCATCTTC CAATATCTGC GAGGGGTGAT TGATAGCATC ATTGAACGCA AAGATATTGA CCATGCAAGT TTAAAAATTG CGGAACTGCT AGATGAAAGC GTAGTTGCTG ATAATCAAGG AATTACCACT AAAGAATATA GTGCAGAATA CAACATTATC CAGACAGGTC AAGTCTGGGA TTTAAGCAAA ATTAACTTCG AGCAACTCAA AGCCGAATTT AGCTACAAAC CCTATAAAAA TATTGAAATT ACCGATTTAC GCAGTTTTAT TGAAGACAAG CTGAACAAAA TGATTCAGCA AAACACCACC CGCACCGACT TCGCCCAAAG GCTACAAGCC ATCATTGACA AATACAACGC TGGTGGTTCA TCAATTGATA ACTACTACGA AGCTCTGGTT AATTTTACAG AAAACCTGAA AGCAGAAGCA GAAAGACATA TTCGGGAAGG CTTAACAGAA GACGAATTGG AACTATTTGA TTTACTCAAG AAAGACAAAA TGACTGCGGA GGAAACTCAA AAAGTCAAAT TAGCTGCACA GTCTCTCTTG CATCGACTCA TAGCAGAACA ACCAAAAGTC TTAGTGCAAG ATTGGTATAA AGATATGCAG AGCCAGCGCA GAGTGCAATC CGTAGTTGAA GCAGTTTTAG ATCAAAACCT GCCTGAAAGC TATGATAGAG CGTTATTTAA GTTTAAATGC GATACAGTAT TTGATTTGAT GTATGACCAC GCTAGCAAAG GTTCAAAGTG GGCGGGATAA
|
Protein sequence | MPNFISEDDI EQAVLAKLKQ QGFKLLNCFT TNPDDLNDRS NRTDKSEVIF SDRLKAAAIR LNPNLPAEAI AQGLETLTNK RQLMSPIAAN REIDRLIRDG IPIEYENAQG RSEHGKVRLI DFNDLSHNGN NEYLAVSQLW IKGERNYRRP DILLYINGLP LVFIELKNSN VKLQTAFDDN LSNYKKDIPQ LFLTNAFCIL SNALETKIGS FTAEWEHFFN WLRVEDEKEK IDRQQIQESG TSLELAIDGL CHPSKLLDYI ENFILYHKET QKIIAQNHQF IGVNRAIDAF DHREVKQGKL GVFWHTQGSG KSFSMIFYAR KIFRKLTGNF TFVIITDRDD LDGQIYRNFL NTETVKEKEA ARPKNSKQLR EFLSLNKRIV FTLIQKFRYE KGEEYPILSD RNDIIVIVDE AHRTQYKTLA ENMRAGLPNA NYLAFTGTPL LGKKRKTNEW FGDYVSEYNF SQSVDDGATV PLFYQKRVPE VLIQNEDLSE EFYQILEDEN LDDTQQAKLE RKFARETEVI KRDDRLETIA KDIVYHFPRR GYLGKGLVVS LDKFTAVKMY DKVQYHWKAE IKNLVGRIKK SSNDIEKIRL KKILDFMRET EMAVVISLEG SDSEVEKFAK QGLDIKPHRE RMNTVDAQGH DIEYQFKDEE NSLRLVFVCA MWLTGFDAPT LSTLYLDKPM KDHTLMQTIA RANRITSYQI HNVTKTNGEI IDYYNVFRNM KQALADYALG DDGDKEDSPI QEKSNLFELL DDAIAQGLNF CRELGIDLES ILKIQETFEK LGQFNQFADT LLQKDEWRKG FFVYQNTITS LYEACKPEII KKPRPLVFIF QYLRGVIDSI IERKDIDHAS LKIAELLDES VVADNQGITT KEYSAEYNII QTGQVWDLSK INFEQLKAEF SYKPYKNIEI TDLRSFIEDK LNKMIQQNTT RTDFAQRLQA IIDKYNAGGS SIDNYYEALV NFTENLKAEA ERHIREGLTE DELELFDLLK KDKMTAEETQ KVKLAAQSLL HRLIAEQPKV LVQDWYKDMQ SQRRVQSVVE AVLDQNLPES YDRALFKFKC DTVFDLMYDH ASKGSKWAG
|
| |