Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1156 |
Symbol | |
ID | 3683351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 1412992 |
End bp | 1415916 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637716492 |
Product | Type I site-specific deoxyribonuclease HsdR |
Protein accession | YP_321675 |
Protein GI | 75907379 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.159677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAC CAGGGGAACA GAAAACTGTT CAAAGCAGGA TAATTACTTA CGCTCAAGAA ATTGGCTGGA CATTTGTATC TCGTGAAGAA GCGGAAACCA GACGGGGTTT TAATAATCAC AGTGGTAACA TTCAAGAGAA GGCACAACAA GCATCCCTGT TTTTTGACGA TACCCTCTAT CAAAAAGTCC GCCAGTTTAA CCCCAAGTAT ACCGAAGCTC AAGGCGTACT TATTGCGCTA CTGAGTAACC TACCTGCCGA CATCTACGGA AATAGAGACT TCCTAACTTA TCTCCGCAAC CAAGGCAAAT ATTTTTATGC CCCTGAAAAT CGGGAACTCG ATCTAAAACT CATCAACTAC GATGACACCA CCGAAAACTC CTACGAAGTA ACGGAAGAAT ACTACATTCA TAACGGCAAG TACGGTACAC GGGAAGATGT AGTCTTTTTA ATCAACGGTA TTCCTATCCT CGTCATCGAA TGCAAAAACG CTACTAAAGA TGAAGCGATC GCATTAGGTG TTGATCAAAT TCGTCGCTAC CACACCGAAA CACCAGAGTT ATTCGTTCCC CAGATGATTT TTACCGCCAC TGAAGCCATT GGCTTTTCGT ATGGGGTGAC GTGGAATATT GTCAGGCGCA ACATTTTTAA CTGGAAAGCA GAACAAATCG GTCAACTAGA AGCAAAAATT AAGAGTTTTT GTCAACCTAA CTATATTCTC CAATTCCTGC AAAATTATAT TCTCTTTGCA GAGAAAGAAG AAACTCTGCA AAAATTTATT CTGCGTCAAC ACCAAACCAC CGCAGTTGAA AAGGTAATTG AACGCTGTCA CGATCCTGAA CACAGCAAAG GACTTGTCTG GCATACCCAA GGCAGTGGTA AGACTTTCAC AATGATTAAG ACGGCGGAAA TGCTCTTTAA AGCCCCACAG AGCGATAAAC CGACAATTCT GCTGATGATT GACCGCAACG AGCTAGAAGA CCAAATGCTC AGAAACCTAA TTAACTTAGG ACTGAATAAC GTTCAACACG CTGATCGCAT TACCACCCTT AATCAACTAC TAAAAGATGA TTATCGTGGC ATTATCGTCA CCATGATCCA CAAATTCCGC GATATGCCTG CCAATATTAA TCTAAGAAAA AATATTTATA TTCTGATTGA TGAGGCACAC CGCACCACCG GAGGAGATTT AGGAACTTTC TTAATGGCAG GACTCCCTAA TGCCACTTTT ATCGGATTTA CTGGTACACC TGTAGATAAA ACTAGCTACG GTAAAGGCAC ATTTAAAACC TTCGGCACAG ACGATAAAAA GGGGTATCTG CATAAATATT CCATTGCAGA AAGTATTGAA GACGGAACAA CACTACCCCT TTACTACAAT ATTGCTCCCA GCAAGATAAT TGTGCCAAAG GAACTCATGG AAAAGGAGTT TCTCAACTTG GCAGAAACTG AGGGGATTAG CGATATTGCC GAATTAAATA AAATTTTAGA TCGTGCCGTT AATTTAAAAA ACTTCCTCAA AGGTAATCAA AGAGTAGATC AGGTAGCAAA ATATGTTGCC AAGCATTACA CTCAAAACGT AGAACCACTA GGTTATAAAG CTTTTTTAGT CGCTGTTGAC CGTCCTGCCT GTGCTAAATA TAAACAAGCT TTAGATAAAT ATTTACCTCC TGAATACTCC GCAGTTGTCT ATACCGGAAA CAATAACGAT ACTCAAGAAC TCAAAACCTA CCACCTCGAC GACAAGACAG AAAAACAAAT CCGTAAAAAC TTCGCCAAAT TTGGAGAATA CCCCAAAATC CTGATTGTTA CCGAGAAACT GTTGACAGGA TTTGATGCTC CCCTGCTTTA TGCCATGTAT CTCGATAAGC CCATGCGAGA TCATACCTTG TTACAGGCGA TCGCTCGTGT GAATCGTCCC TACGAAAGCG AAGCTCAAGA GATGGTGAAA CCGCATGGTT TTGTGCTGGA TTTTGTAGGC ATTTTCAACA AACTAGAAAA AGCACTTGCC TTTGACAGCG ATGAAATCAA CGCTATTATC AAAGATTTGA AACTGCTCAA AACTTTATTC AAAAATAAAA TGGAGCAGCA AGCCTCAACT TATTTGAGCT TGATTCAGAA CAATTTTAAC GATAAAGATG TAGATGATAT TATTGAGCAT TTTCGAGAAC AAGAACGCAG AAAAGCATTT TCTAAAGCCT ATAAAGAACT AGAAATGCTC TATGAAGTTA TTTCCCCTGA TGCTTTCCTA CGTCCCTACA TGGACGATTA CGCCACCCTT TCCTCTATTT ATGAAGTAAT CCGCAAAGCT TACAGCAAAA GGGTTTACGT CGATAAAGCG TTGCAGCGCA AAACTGATGA ACTCGTACAA AAACATATTG GCACAACAGC GATCGCAACT GTAACAGATT TTGTAGAAAT CAATGCTCAA ACCCTCGAAG TCATCAAAGA CAAGCAAGGA GGTGAAACCA CCAAAGTCAT CAACCTCATC AAAAGCATCG AAAAAACTGC CGAAGAAAAT CCCGATGACC CTTTTCTCAT TGCAATGGCG GAACGTGCCA AAATTGTCCA AGAAAAGTTT GAAAATCGCC AGTCCGATAC CCAAGAAGCC TTAGATACGT TGGTTAAAGA AATTGGTGAA AATGAGCAGC GCAAGAAAGA ACAGGCTAAA CGTGGTTTTG ACAGCTTAAC GTTTTTTGTC TTTCAAACCC TACAGGATGC AGGTATTGAT AATCCTGAAG CAGTAAGCAA TCAAATTAAA CAAGCTTTTA TAGAGTATCC CAATTGGCGC ATCAGTGAGG CAGAATTGAG AGAGTTGCGG AAAGAAGTCA CCTTCGCCAT TTTTGCTGAG ATCGATGAAT TGGATCAAGT GACAGCAGTT GTTGATAAAC TTTTTATGTT GCTAAGTCAA GCTTACTCAA GCTAA
|
Protein sequence | MPKPGEQKTV QSRIITYAQE IGWTFVSREE AETRRGFNNH SGNIQEKAQQ ASLFFDDTLY QKVRQFNPKY TEAQGVLIAL LSNLPADIYG NRDFLTYLRN QGKYFYAPEN RELDLKLINY DDTTENSYEV TEEYYIHNGK YGTREDVVFL INGIPILVIE CKNATKDEAI ALGVDQIRRY HTETPELFVP QMIFTATEAI GFSYGVTWNI VRRNIFNWKA EQIGQLEAKI KSFCQPNYIL QFLQNYILFA EKEETLQKFI LRQHQTTAVE KVIERCHDPE HSKGLVWHTQ GSGKTFTMIK TAEMLFKAPQ SDKPTILLMI DRNELEDQML RNLINLGLNN VQHADRITTL NQLLKDDYRG IIVTMIHKFR DMPANINLRK NIYILIDEAH RTTGGDLGTF LMAGLPNATF IGFTGTPVDK TSYGKGTFKT FGTDDKKGYL HKYSIAESIE DGTTLPLYYN IAPSKIIVPK ELMEKEFLNL AETEGISDIA ELNKILDRAV NLKNFLKGNQ RVDQVAKYVA KHYTQNVEPL GYKAFLVAVD RPACAKYKQA LDKYLPPEYS AVVYTGNNND TQELKTYHLD DKTEKQIRKN FAKFGEYPKI LIVTEKLLTG FDAPLLYAMY LDKPMRDHTL LQAIARVNRP YESEAQEMVK PHGFVLDFVG IFNKLEKALA FDSDEINAII KDLKLLKTLF KNKMEQQAST YLSLIQNNFN DKDVDDIIEH FREQERRKAF SKAYKELEML YEVISPDAFL RPYMDDYATL SSIYEVIRKA YSKRVYVDKA LQRKTDELVQ KHIGTTAIAT VTDFVEINAQ TLEVIKDKQG GETTKVINLI KSIEKTAEEN PDDPFLIAMA ERAKIVQEKF ENRQSDTQEA LDTLVKEIGE NEQRKKEQAK RGFDSLTFFV FQTLQDAGID NPEAVSNQIK QAFIEYPNWR ISEAELRELR KEVTFAIFAE IDELDQVTAV VDKLFMLLSQ AYSS
|
| |