Gene Achl_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1093 
Symbol 
ID7292538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1202179 
End bp1205403 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content64% 
IMG OID643589500 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002487175 
Protein GI220911866 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCG GTCCAATAGG GGGATCCACC GTGACAACAG CACCAGCAGT AACCTTTTCA 
GAAGCCGAGT GGGAAGGCAT GGCCCTTGAG CTCCTGGCCG AGCCGCTCGG CTGGCGGCCC
ATGTCCGGGC AGGCCATCGC CCCCGGCACC GGCCAGCGGG ACACCTGGGA CGAGCTGCTC
ATCCGACCCC GGCTGCTCAC AGCCCTGCAG AAATTCAACC CCAAGGTTCC CGCCGAGTAC
CTGCAGCAGG CCCTTGCCGA GATTGCCTCG CCCAAATCCA ACGACCCCAT CACCGAGAAC
CACCGCATCC ACAACTACCT GGTGGACGGC TACCGGCTCA GCTACATCGA CTCCGACGGC
AACGAAGCCA ACCCCACCAT CCACCTGCTG AGCCAGGACC CCGACCAGAA CGACTGGCTC
GCCGTCAACC AGGTCACTCT GATCCAGGGC GACTACAACC GCCGTTTCGA CATTGTCCTG
TACTGCAACG GCATGCCGGT GAGCATCATC GAACTGAAGA AGGCCGGCAG CGCCACCGCT
GACGTTGCCT CAGCGCACGC CCAGCTGCAG ACGTACCTGC GGGAATTCCC CATGGGGTTC
CGCTTCTGTG TCTTTACGCT CGCCTCAGAC GGCATCCAGG CCAAGTACGG CACCCCGTTC
ACTCCCCTCA ACCACTTCTC GCCCTGGAAC GTGGACGACG ACGGTCTGCC GGTGGCTCCC
GGCTACATGG AGGACGGGGT CGCCATCACA GCGTTGGAGA CCGCCCTCAA CGGCCTGTAC
AACCAGGAGC GCTTCCTGCA GCTGACCCGC AGCTTCACGG CGTTCGACGA GGGATCCGAC
GGGCTCGTGA AACGCATTGC CAAGCCCCAC CAGTACTTCG CCGTGACCAA GGCCGTAGGC
AGCACTGTCC AAGCGGTGGA AAGCAACGGC AAGGCCGGCG TTGTATGGCA CACCCAGGGC
TCCGGCAAGT CAATGGAGAT GGAGCTTTAC GCCAACCTGG TGGCCCGGCA CCCCAAGCTG
AAGAACCCGA CAGTTGTCGT GATCACGGAC CGCAACGAAC TCGACGGCCA GCTGTTCGAG
GGCTTTGACC GGAGTCTTCT CCTGGCCGAA TCGCCCAGGC AGATCCGGAA ACGCTCCGAG
CTGCGGGAGG AACTGAGCAA CCGCACGACC GGCGGCATCT ACTTCACCAC CCTGCAGAAG
TTCGGCCGCA GCAAGGCCGA GAAAGACGCC GGCACCGAGC ACCCGCTGCT GTCGGACCGC
CGCAACATCA TCGTGGTCGT GGATGAGGCC CATCGCTCGC ACTATGACGA CCTGAACGGC
TACGCCCGTC ACCTGCGCGA CGCGCTGCCC CACGCGACCC TGATCGCGTT TACCGGCACG
CCGCTTTCAT TCGCCGACCG CAATACCCGG GAAGTTTTCG GCGATGACAT CGACGTGTAC
GACCTCACCC GAGCGGTCAC AGATGGAGCC ACGGTGCCGG TGTACTTCGA GCCCCGGCTG
ATCAAGGTGG GTCTGGCGTC CGAGGTCACC GAGGAGATCC TTGACCAGGC GGCCGACGAG
GCCACCTTGG GCCTGGACGA CACGGAGCGG GCCCGCCTCG AGGCGAGCGT CGCGGTGGTG
AACGCCGTCT ACGGCGCCCC GCAGCGCATC GCAGCACTGG CAGAGGATCT AGTGGCGCAC
TGGGAGGGCC GGCGCGCCCA GATGGGCAAG TTTATCGAAT CGCCGGGCAA GGCGATGATC
GTAGGCGGGA CGCGGGAAAT TTGCGCAAAG CTGTACACGG CGATCGTGGA GCTGCGGCCC
GACTGGCATT CCGATGACCT GGCCAAGGGC AAGATCAAAG TCGTCTACTC CGGTGACGCC
ACCGATGTTC CGCCGGTATC CGACCATGTG CGGCGCGACT CCGCCAATGC GACCATCAAG
GAACGTCTGA AGGACGTCGA CGATGAGCTG GAGCTAGTGA TCGTCAAAGA CATGATGCTG
ACGGGCTACG ACTCCCCGCC GCTACACACG CTGTACCTGG ACCGGCCGCT GAAGGGCGCG
CTGCTGATGC AGACCCTGGC CCGGGTGAAC CGCACCTTCC GCGGCAAGAA GGACGGGCTG
CTGGTGGCCT ATGCCCCCCT GGCGGAGAAC CTGGCGCAGG CCCTGAGCGA GTACACGAAG
GATGACCAAG CGAACAAGCC CGTCGGCCGG AACGTAGATG AGGCCATTGG CCTGACGGTG
ACGCTGGTGG AAACGCTGCG CGGCCTGCTC GCCGGGTACG ACTGGAAAGC GGTGCTGATG
CGGGGCGGCC CCAAAGCCTT TATCAGCGCT GCCACCGGCG CGGCCAACTA CCTGCGAAGC
CCCGAAACAC CCGGTAACCG TCCGGCTGAG GGCGAGGAGA CGCTGGCCTC CAAATACCGC
CGGCACTCGG GCCAGTTGTC GCGGGCGTGG GCGCTGTGTT CCGGTTCGGA GACGCTGGCC
GAGCTGCGGC CGGAGATCCA GGTCTACGAG GAAACCCGGG TGTACATGGC CAAGTTCGAC
GCGGCAGACC GCCAGGCCAG CGGCGAGCCC GTGCCCGAGG AGATTCAGCG GCTGCTCGGC
AATCTGATCG CATCAGCTAC ATCGTCGGGC GAGGTACTGG ACATCTACGA GGCAGCTGGC
ATGCCAAAGC CCTCACTGGA TGACCTGACG CCGGAGTTCA TCGCCAAGAC CCAGAAGGCA
CGCAACCCCC AGCTGGCTAT TGAGGCGCTG CGGAAGCTCA TCTCTGATGA GTCGGCTGTG
GCAACACGAA ACAACGTGAT CCGCCAGCGG GCGTTCTCGG AGCGCATCAC CGAGTTGATG
AAGAAGTACA CCAACCAGCA GTTGACGTCT GCTGAGGTTA TCGCCGAGCT GGTGGAGCTG
GCCCGTGAGG TGGCTGCCGA AGGGAACCGT GGAGGGCACT TCACTCCCCC GCTGAACTCT
GACGAGCTGG CCTTCTATGA TGCCGTGGCC TCCAATGAGT CCGCCGTGGA GGTCCAAGGC
GAGGGCGTAC TCGCCGACAT CGCCCGCGAA CTGGTCTCCG TCATGCGCCG CGACATCCGG
ACCGACTGGA CAGTGCGCGA CGACGTCAGG GCCAAGCTCC GCTCCTCCAT CAAGAGGCTC
CTCGTCCGGT TTGGGTACCC GCCGGACAAG CAGCCCGAGG CGATCAAGCT GGTCATGGAG
CAAATGGAGT CCATGGCACC GCGGTTCGCG GAGGCTCGAC TGTGA
 
Protein sequence
MTAGPIGGST VTTAPAVTFS EAEWEGMALE LLAEPLGWRP MSGQAIAPGT GQRDTWDELL 
IRPRLLTALQ KFNPKVPAEY LQQALAEIAS PKSNDPITEN HRIHNYLVDG YRLSYIDSDG
NEANPTIHLL SQDPDQNDWL AVNQVTLIQG DYNRRFDIVL YCNGMPVSII ELKKAGSATA
DVASAHAQLQ TYLREFPMGF RFCVFTLASD GIQAKYGTPF TPLNHFSPWN VDDDGLPVAP
GYMEDGVAIT ALETALNGLY NQERFLQLTR SFTAFDEGSD GLVKRIAKPH QYFAVTKAVG
STVQAVESNG KAGVVWHTQG SGKSMEMELY ANLVARHPKL KNPTVVVITD RNELDGQLFE
GFDRSLLLAE SPRQIRKRSE LREELSNRTT GGIYFTTLQK FGRSKAEKDA GTEHPLLSDR
RNIIVVVDEA HRSHYDDLNG YARHLRDALP HATLIAFTGT PLSFADRNTR EVFGDDIDVY
DLTRAVTDGA TVPVYFEPRL IKVGLASEVT EEILDQAADE ATLGLDDTER ARLEASVAVV
NAVYGAPQRI AALAEDLVAH WEGRRAQMGK FIESPGKAMI VGGTREICAK LYTAIVELRP
DWHSDDLAKG KIKVVYSGDA TDVPPVSDHV RRDSANATIK ERLKDVDDEL ELVIVKDMML
TGYDSPPLHT LYLDRPLKGA LLMQTLARVN RTFRGKKDGL LVAYAPLAEN LAQALSEYTK
DDQANKPVGR NVDEAIGLTV TLVETLRGLL AGYDWKAVLM RGGPKAFISA ATGAANYLRS
PETPGNRPAE GEETLASKYR RHSGQLSRAW ALCSGSETLA ELRPEIQVYE ETRVYMAKFD
AADRQASGEP VPEEIQRLLG NLIASATSSG EVLDIYEAAG MPKPSLDDLT PEFIAKTQKA
RNPQLAIEAL RKLISDESAV ATRNNVIRQR AFSERITELM KKYTNQQLTS AEVIAELVEL
AREVAAEGNR GGHFTPPLNS DELAFYDAVA SNESAVEVQG EGVLADIARE LVSVMRRDIR
TDWTVRDDVR AKLRSSIKRL LVRFGYPPDK QPEAIKLVME QMESMAPRFA EARL