Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1093 |
Symbol | |
ID | 7292538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 1202179 |
End bp | 1205403 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643589500 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002487175 |
Protein GI | 220911866 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCCG GTCCAATAGG GGGATCCACC GTGACAACAG CACCAGCAGT AACCTTTTCA GAAGCCGAGT GGGAAGGCAT GGCCCTTGAG CTCCTGGCCG AGCCGCTCGG CTGGCGGCCC ATGTCCGGGC AGGCCATCGC CCCCGGCACC GGCCAGCGGG ACACCTGGGA CGAGCTGCTC ATCCGACCCC GGCTGCTCAC AGCCCTGCAG AAATTCAACC CCAAGGTTCC CGCCGAGTAC CTGCAGCAGG CCCTTGCCGA GATTGCCTCG CCCAAATCCA ACGACCCCAT CACCGAGAAC CACCGCATCC ACAACTACCT GGTGGACGGC TACCGGCTCA GCTACATCGA CTCCGACGGC AACGAAGCCA ACCCCACCAT CCACCTGCTG AGCCAGGACC CCGACCAGAA CGACTGGCTC GCCGTCAACC AGGTCACTCT GATCCAGGGC GACTACAACC GCCGTTTCGA CATTGTCCTG TACTGCAACG GCATGCCGGT GAGCATCATC GAACTGAAGA AGGCCGGCAG CGCCACCGCT GACGTTGCCT CAGCGCACGC CCAGCTGCAG ACGTACCTGC GGGAATTCCC CATGGGGTTC CGCTTCTGTG TCTTTACGCT CGCCTCAGAC GGCATCCAGG CCAAGTACGG CACCCCGTTC ACTCCCCTCA ACCACTTCTC GCCCTGGAAC GTGGACGACG ACGGTCTGCC GGTGGCTCCC GGCTACATGG AGGACGGGGT CGCCATCACA GCGTTGGAGA CCGCCCTCAA CGGCCTGTAC AACCAGGAGC GCTTCCTGCA GCTGACCCGC AGCTTCACGG CGTTCGACGA GGGATCCGAC GGGCTCGTGA AACGCATTGC CAAGCCCCAC CAGTACTTCG CCGTGACCAA GGCCGTAGGC AGCACTGTCC AAGCGGTGGA AAGCAACGGC AAGGCCGGCG TTGTATGGCA CACCCAGGGC TCCGGCAAGT CAATGGAGAT GGAGCTTTAC GCCAACCTGG TGGCCCGGCA CCCCAAGCTG AAGAACCCGA CAGTTGTCGT GATCACGGAC CGCAACGAAC TCGACGGCCA GCTGTTCGAG GGCTTTGACC GGAGTCTTCT CCTGGCCGAA TCGCCCAGGC AGATCCGGAA ACGCTCCGAG CTGCGGGAGG AACTGAGCAA CCGCACGACC GGCGGCATCT ACTTCACCAC CCTGCAGAAG TTCGGCCGCA GCAAGGCCGA GAAAGACGCC GGCACCGAGC ACCCGCTGCT GTCGGACCGC CGCAACATCA TCGTGGTCGT GGATGAGGCC CATCGCTCGC ACTATGACGA CCTGAACGGC TACGCCCGTC ACCTGCGCGA CGCGCTGCCC CACGCGACCC TGATCGCGTT TACCGGCACG CCGCTTTCAT TCGCCGACCG CAATACCCGG GAAGTTTTCG GCGATGACAT CGACGTGTAC GACCTCACCC GAGCGGTCAC AGATGGAGCC ACGGTGCCGG TGTACTTCGA GCCCCGGCTG ATCAAGGTGG GTCTGGCGTC CGAGGTCACC GAGGAGATCC TTGACCAGGC GGCCGACGAG GCCACCTTGG GCCTGGACGA CACGGAGCGG GCCCGCCTCG AGGCGAGCGT CGCGGTGGTG AACGCCGTCT ACGGCGCCCC GCAGCGCATC GCAGCACTGG CAGAGGATCT AGTGGCGCAC TGGGAGGGCC GGCGCGCCCA GATGGGCAAG TTTATCGAAT CGCCGGGCAA GGCGATGATC GTAGGCGGGA CGCGGGAAAT TTGCGCAAAG CTGTACACGG CGATCGTGGA GCTGCGGCCC GACTGGCATT CCGATGACCT GGCCAAGGGC AAGATCAAAG TCGTCTACTC CGGTGACGCC ACCGATGTTC CGCCGGTATC CGACCATGTG CGGCGCGACT CCGCCAATGC GACCATCAAG GAACGTCTGA AGGACGTCGA CGATGAGCTG GAGCTAGTGA TCGTCAAAGA CATGATGCTG ACGGGCTACG ACTCCCCGCC GCTACACACG CTGTACCTGG ACCGGCCGCT GAAGGGCGCG CTGCTGATGC AGACCCTGGC CCGGGTGAAC CGCACCTTCC GCGGCAAGAA GGACGGGCTG CTGGTGGCCT ATGCCCCCCT GGCGGAGAAC CTGGCGCAGG CCCTGAGCGA GTACACGAAG GATGACCAAG CGAACAAGCC CGTCGGCCGG AACGTAGATG AGGCCATTGG CCTGACGGTG ACGCTGGTGG AAACGCTGCG CGGCCTGCTC GCCGGGTACG ACTGGAAAGC GGTGCTGATG CGGGGCGGCC CCAAAGCCTT TATCAGCGCT GCCACCGGCG CGGCCAACTA CCTGCGAAGC CCCGAAACAC CCGGTAACCG TCCGGCTGAG GGCGAGGAGA CGCTGGCCTC CAAATACCGC CGGCACTCGG GCCAGTTGTC GCGGGCGTGG GCGCTGTGTT CCGGTTCGGA GACGCTGGCC GAGCTGCGGC CGGAGATCCA GGTCTACGAG GAAACCCGGG TGTACATGGC CAAGTTCGAC GCGGCAGACC GCCAGGCCAG CGGCGAGCCC GTGCCCGAGG AGATTCAGCG GCTGCTCGGC AATCTGATCG CATCAGCTAC ATCGTCGGGC GAGGTACTGG ACATCTACGA GGCAGCTGGC ATGCCAAAGC CCTCACTGGA TGACCTGACG CCGGAGTTCA TCGCCAAGAC CCAGAAGGCA CGCAACCCCC AGCTGGCTAT TGAGGCGCTG CGGAAGCTCA TCTCTGATGA GTCGGCTGTG GCAACACGAA ACAACGTGAT CCGCCAGCGG GCGTTCTCGG AGCGCATCAC CGAGTTGATG AAGAAGTACA CCAACCAGCA GTTGACGTCT GCTGAGGTTA TCGCCGAGCT GGTGGAGCTG GCCCGTGAGG TGGCTGCCGA AGGGAACCGT GGAGGGCACT TCACTCCCCC GCTGAACTCT GACGAGCTGG CCTTCTATGA TGCCGTGGCC TCCAATGAGT CCGCCGTGGA GGTCCAAGGC GAGGGCGTAC TCGCCGACAT CGCCCGCGAA CTGGTCTCCG TCATGCGCCG CGACATCCGG ACCGACTGGA CAGTGCGCGA CGACGTCAGG GCCAAGCTCC GCTCCTCCAT CAAGAGGCTC CTCGTCCGGT TTGGGTACCC GCCGGACAAG CAGCCCGAGG CGATCAAGCT GGTCATGGAG CAAATGGAGT CCATGGCACC GCGGTTCGCG GAGGCTCGAC TGTGA
|
Protein sequence | MTAGPIGGST VTTAPAVTFS EAEWEGMALE LLAEPLGWRP MSGQAIAPGT GQRDTWDELL IRPRLLTALQ KFNPKVPAEY LQQALAEIAS PKSNDPITEN HRIHNYLVDG YRLSYIDSDG NEANPTIHLL SQDPDQNDWL AVNQVTLIQG DYNRRFDIVL YCNGMPVSII ELKKAGSATA DVASAHAQLQ TYLREFPMGF RFCVFTLASD GIQAKYGTPF TPLNHFSPWN VDDDGLPVAP GYMEDGVAIT ALETALNGLY NQERFLQLTR SFTAFDEGSD GLVKRIAKPH QYFAVTKAVG STVQAVESNG KAGVVWHTQG SGKSMEMELY ANLVARHPKL KNPTVVVITD RNELDGQLFE GFDRSLLLAE SPRQIRKRSE LREELSNRTT GGIYFTTLQK FGRSKAEKDA GTEHPLLSDR RNIIVVVDEA HRSHYDDLNG YARHLRDALP HATLIAFTGT PLSFADRNTR EVFGDDIDVY DLTRAVTDGA TVPVYFEPRL IKVGLASEVT EEILDQAADE ATLGLDDTER ARLEASVAVV NAVYGAPQRI AALAEDLVAH WEGRRAQMGK FIESPGKAMI VGGTREICAK LYTAIVELRP DWHSDDLAKG KIKVVYSGDA TDVPPVSDHV RRDSANATIK ERLKDVDDEL ELVIVKDMML TGYDSPPLHT LYLDRPLKGA LLMQTLARVN RTFRGKKDGL LVAYAPLAEN LAQALSEYTK DDQANKPVGR NVDEAIGLTV TLVETLRGLL AGYDWKAVLM RGGPKAFISA ATGAANYLRS PETPGNRPAE GEETLASKYR RHSGQLSRAW ALCSGSETLA ELRPEIQVYE ETRVYMAKFD AADRQASGEP VPEEIQRLLG NLIASATSSG EVLDIYEAAG MPKPSLDDLT PEFIAKTQKA RNPQLAIEAL RKLISDESAV ATRNNVIRQR AFSERITELM KKYTNQQLTS AEVIAELVEL AREVAAEGNR GGHFTPPLNS DELAFYDAVA SNESAVEVQG EGVLADIARE LVSVMRRDIR TDWTVRDDVR AKLRSSIKRL LVRFGYPPDK QPEAIKLVME QMESMAPRFA EARL
|
| |