Gene Ava_3502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3502 
Symbol 
ID3679775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4342244 
End bp4345483 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content42% 
IMG OID637718854 
ProductType I site-specific deoxyribonuclease HsdR 
Protein accessionYP_324004 
Protein GI75909708 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.23995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAACT TTATTTCTGA AGATGACATC GAACAAGCGG TACTTGCCAA ATTAAAGCAG 
CAAGGCTTTA AATTACTCAA CTGCTTCACA ACCAACCCCG ACGATCTCAA CGACAGATCC
AACCGCACAG ATAAAAGCGA GGTCATTTTT AGCGATCGCC TGAAAGCAGC AGCAATTCGC
CTCAACCCCA ACTTACCAGC AGAAGCGATC GCCCAAGGTT TAGAAACTCT TACCAATAAA
CGCCAGTTGA TGTCTCCCAT CGCCGCAAAT CGGGAAATAG ACAGGCTAAT TCGTGATGGT
ATCCCCATTG AATATGAAAA CGCCCAAGGT AGAAGCGAAC ACGGGAAAGT GCGCTTAATA
GATTTTAATG ACTTGAGTCA CAACGGAAAT AATGAATATT TAGCCGTTTC CCAACTTTGG
ATTAAAGGAG AAAGAAACTA TCGCCGTCCT GATATCTTGC TATATATCAA CGGTTTACCC
CTGGTTTTCA TCGAACTAAA AAACTCCAAC GTCAAACTGC AAACAGCCTT TGACGACAAC
CTCAGCAACT ACAAAAAAGA TATTCCTCAA CTCTTTCTCA CCAACGCCTT CTGCATTCTC
TCCAACGCCC TAGAAACCAA AATTGGCAGT TTTACCGCAG AGTGGGAACA CTTTTTTAAC
TGGTTACGGG TGGAAGATGA AAAGGAAAAA ATAGATCGTC AGCAGATTCA AGAATCTGGT
ACTAGCCTAG AACTAGCTAT AGATGGGCTT TGTCACCCTT CTAAACTACT CGACTACATC
GAAAACTTTA TTCTCTATCA CAAAGAAACT CAGAAAATCA TCGCCCAAAA CCACCAGTTC
ATCGGTGTGA ATCGCGCTAT TGATGCTTTT GACCACCGGG AAGTAAAACA GGGCAAATTA
GGAGTTTTCT GGCACACACA AGGCTCTGGT AAAAGCTTTT CGATGATTTT TTATGCCCGA
AAGATTTTCC GCAAGCTCAC GGGCAATTTT ACCTTTGTGA TTATTACCGA CCGCGATGAT
TTAGATGGGC AGATTTACCG CAATTTCTTA AACACCGAAA CAGTCAAAGA GAAAGAAGCA
GCTCGACCCA AAAACAGTAA ACAACTCCGG GAGTTTCTTT CCCTCAACAA GCGAATAGTT
TTTACCCTGA TTCAAAAGTT CCGCTACGAG AAGGGGGAGG AGTATCCCAT ATTGAGCGAT
CGCAATGACA TTATTGTAAT TGTCGATGAA GCACACCGCA CCCAGTATAA AACCTTAGCG
GAAAATATGC GGGCAGGTTT ACCCAATGCC AATTATCTCG CCTTCACAGG TACTCCCCTA
TTAGGTAAAA AGCGCAAGAC TAACGAATGG TTTGGGGATT ATGTCAGCGA GTATAATTTT
TCTCAGTCAG TAGATGATGG CGCAACCGTC CCCCTGTTTT ACCAGAAGCG AGTACCAGAG
GTACTGATTC AAAACGAGGA TTTAAGCGAG GAATTTTACC AAATCCTTGA AGATGAAAAC
CTTGATGACA CCCAGCAAGC CAAGCTAGAA AGGAAATTTG CCCGCGAAAC CGAAGTTATT
AAACGTGATG ATAGGTTAGA GACAATAGCA AAAGATATTG TTTACCACTT TCCTCGTCGT
GGCTATCTTG GTAAAGGCTT GGTGGTTTCT CTTGATAAAT TCACTGCGGT GAAAATGTAC
GACAAAGTAC AGTACCACTG GAAAGCAGAA ATTAAAAATC TGGTGGGACG AATTAAGAAA
TCTAGTAATG ATATTGAGAA AATTCGCCTG AAAAAGATTC TCGACTTTAT GCGAGAAACA
GAAATGGCTG TGGTGATTAG TCTGGAAGGA AGCGACAGCG AAGTTGAAAA ATTTGCGAAA
CAAGGACTTG ATATCAAACC CCACCGCGAA AGAATGAATA CCGTTGATGC ACAGGGACAT
GATATCGAGT ATCAATTTAA AGATGAGGAA AATTCCCTGC GGTTGGTGTT TGTCTGCGCC
ATGTGGCTGA CGGGTTTCGA TGCGCCCACT CTCTCGACGC TTTACCTTGA CAAGCCCATG
AAAGACCACA CGCTGATGCA AACTATCGCC CGTGCGAACC GCATCACCTC ATATCAGATT
CACAACGTCA CCAAAACCAA CGGCGAAATT ATTGATTATT ACAACGTCTT CCGCAACATG
AAGCAAGCCT TGGCAGATTA CGCTTTGGGC GATGATGGAG ACAAAGAAGA CTCACCCATT
CAAGAAAAGT CTAACCTGTT TGAGTTACTT GATGATGCGA TCGCCCAAGG ACTGAATTTT
TGTCGGGAAC TGGGGATTGA TTTAGAAAGC ATCCTCAAAA TTCAGGAAAC TTTCGAGAAA
TTAGGACAAT TTAACCAGTT TGCTGACACT CTACTGCAAA AAGATGAATG GCGTAAAGGA
TTTTTTGTCT ATCAAAATAC TATTACCTCA CTCTATGAAG CCTGTAAACC AGAGATAATC
AAGAAACCAC GTCCATTGGT TTTCATCTTC CAATATCTGC GAGGGGTGAT TGATAGCATC
ATTGAACGCA AAGATATTGA CCATGCAAGT TTAAAAATTG CGGAACTGCT AGATGAAAGC
GTAGTTGCTG ATAATCAAGG AATTACCACT AAAGAATATA GTGCAGAATA CAACATTATC
CAGACAGGTC AAGTCTGGGA TTTAAGCAAA ATTAACTTCG AGCAACTCAA AGCCGAATTT
AGCTACAAAC CCTATAAAAA TATTGAAATT ACCGATTTAC GCAGTTTTAT TGAAGACAAG
CTGAACAAAA TGATTCAGCA AAACACCACC CGCACCGACT TCGCCCAAAG GCTACAAGCC
ATCATTGACA AATACAACGC TGGTGGTTCA TCAATTGATA ACTACTACGA AGCTCTGGTT
AATTTTACAG AAAACCTGAA AGCAGAAGCA GAAAGACATA TTCGGGAAGG CTTAACAGAA
GACGAATTGG AACTATTTGA TTTACTCAAG AAAGACAAAA TGACTGCGGA GGAAACTCAA
AAAGTCAAAT TAGCTGCACA GTCTCTCTTG CATCGACTCA TAGCAGAACA ACCAAAAGTC
TTAGTGCAAG ATTGGTATAA AGATATGCAG AGCCAGCGCA GAGTGCAATC CGTAGTTGAA
GCAGTTTTAG ATCAAAACCT GCCTGAAAGC TATGATAGAG CGTTATTTAA GTTTAAATGC
GATACAGTAT TTGATTTGAT GTATGACCAC GCTAGCAAAG GTTCAAAGTG GGCGGGATAA
 
Protein sequence
MPNFISEDDI EQAVLAKLKQ QGFKLLNCFT TNPDDLNDRS NRTDKSEVIF SDRLKAAAIR 
LNPNLPAEAI AQGLETLTNK RQLMSPIAAN REIDRLIRDG IPIEYENAQG RSEHGKVRLI
DFNDLSHNGN NEYLAVSQLW IKGERNYRRP DILLYINGLP LVFIELKNSN VKLQTAFDDN
LSNYKKDIPQ LFLTNAFCIL SNALETKIGS FTAEWEHFFN WLRVEDEKEK IDRQQIQESG
TSLELAIDGL CHPSKLLDYI ENFILYHKET QKIIAQNHQF IGVNRAIDAF DHREVKQGKL
GVFWHTQGSG KSFSMIFYAR KIFRKLTGNF TFVIITDRDD LDGQIYRNFL NTETVKEKEA
ARPKNSKQLR EFLSLNKRIV FTLIQKFRYE KGEEYPILSD RNDIIVIVDE AHRTQYKTLA
ENMRAGLPNA NYLAFTGTPL LGKKRKTNEW FGDYVSEYNF SQSVDDGATV PLFYQKRVPE
VLIQNEDLSE EFYQILEDEN LDDTQQAKLE RKFARETEVI KRDDRLETIA KDIVYHFPRR
GYLGKGLVVS LDKFTAVKMY DKVQYHWKAE IKNLVGRIKK SSNDIEKIRL KKILDFMRET
EMAVVISLEG SDSEVEKFAK QGLDIKPHRE RMNTVDAQGH DIEYQFKDEE NSLRLVFVCA
MWLTGFDAPT LSTLYLDKPM KDHTLMQTIA RANRITSYQI HNVTKTNGEI IDYYNVFRNM
KQALADYALG DDGDKEDSPI QEKSNLFELL DDAIAQGLNF CRELGIDLES ILKIQETFEK
LGQFNQFADT LLQKDEWRKG FFVYQNTITS LYEACKPEII KKPRPLVFIF QYLRGVIDSI
IERKDIDHAS LKIAELLDES VVADNQGITT KEYSAEYNII QTGQVWDLSK INFEQLKAEF
SYKPYKNIEI TDLRSFIEDK LNKMIQQNTT RTDFAQRLQA IIDKYNAGGS SIDNYYEALV
NFTENLKAEA ERHIREGLTE DELELFDLLK KDKMTAEETQ KVKLAAQSLL HRLIAEQPKV
LVQDWYKDMQ SQRRVQSVVE AVLDQNLPES YDRALFKFKC DTVFDLMYDH ASKGSKWAG