Gene Ajs_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_3606 
Symbol 
ID4673627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp3806730 
End bp3809732 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content61% 
IMG OID639840638 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_987794 
Protein GI121595898 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTAACG AATCCAACAC CGTCGAAGCC TATGTCCGTG ATCTGCTCGC CGGTCCCATC 
AAGGCTGCCC CAGTCAACAC CGCCCAGGAA CCCCAAGCTA GCTACGGGCC CAGCCCCAAA
GGCATCGGCT GGCGCTACGC TGCTCCGTCT GAGGTGCCGC GCCAGATTCA GGAAGTGCTG
GTTGAACCCT GGCTGCGCGA CGCGCTGATT CGTCTAAACC CGGAGATCGC CGCCCAGCCG
GACCGGGCCG ACGAGGTGCT CTACAAGCTG CGTGCCATCG TGCTGTCGGT GCGCTCGGAT
GGTCTGATTC GCGCCAACGA GGAAATGACC GCGTGGATGC GCGGTGAGCG CTCGATGCCC
TTCGGCCACA ACAACGAGCA TGTGCCGGTG CGGCTGATCG ACCTGGATGA TCTGTCGCAA
AACCACTACA TCGTCACCCA GCAATACACC TACCGTGCAG GCCCCACCGA ACGCCGGGCT
GATCTGGTGC TGTTGGTGAA TGGCCTGCCG CTGGTACTGA TCGAGGCCAA GACCCCGGTC
AAAAAATGCA TCAGCTGGGT CGACGCGGCG GTGCAGGTGC ACGATGACTA CGAGAAGTTC
GTGCCTGAGC TCTTCGTCTG CAACGTGTTC TCGGTGGCGA CCGAGGGCAA GGCGTACCGC
TATGGCTCCA TCGGGCTACC AGTCAAGGAC TGGGGCCCTT GGCATCTGGA TGGTGATGGC
GAGGATAGTC AGCACCATCC GCTGAAGTCG CTCAAGCTGT CGGCCGAGAG CATGCTGCGC
CCGCATGTGG TGCTGGACAT CCTCGGCAGT TTCACCCTGT TCGCCACCAA CAAGAAAAAG
CAGCGCATCA AGATCATTTG CCGGTACCAG CAGTTTGAAG CGGCCAACAA GATCGTCGAG
CGGGTGCTGG CGGGCTACCC CAGGAAGGGC TTGATCTGGC ATTTCCAGGG CTCGGGCAAG
TCGCTGCTGA TGGTGTTTGC CGCGCAGAAG CTGCGCATGC ACGCAGGATT GAAGAATCCC
ACCGTGCTGA TCGTGGTGGA TCGGATCGAT CTTGATAGCC AGATCACCGG TACCTTCACC
GGAGCGGACA TTCCCAACCT GGAAAAGGCG GATACCCGCG AGAAGCTGCA GCAGCTGCTG
GCGCAGGACG TGCGCAAGAT CATCATCACC ACGATCTTCA AGTTCGGCGA GGGCCCCAAC
TCAACAAAAG GGGGCAGCCT GAACGACCGC AGCAACATCA TCGCCTTGGT GGACGAAGCC
CACCGCACGC AAGAAGGCGA CCTGGGCCGC AAGATGCGCG AAGCCCTGCC CAACGCGTTT
CTGTTCGGCC TGACCGGCAC GCCGATCAAC CGTGCCGACC GCAACACCTT CTACGCCTTT
GGTGCAGACG AGGATGAGAA AGGCTACATG AGCCGGTACG GCTTCGAGGA GTCGATCCGC
GACGGTGCCA CGCTAAAACT GCACTTCGAA CCGCGCTTGA TTGATCTGCA CATCGACAAG
GCCGCGCTGG ACGCCGCCTA CAAAGACCTG ACCGGCGGCC TGTCGGATCT CGACAAGGAC
AACCTCGCTA AAACCGCCGC CAAGATGGCC GTGCTGGTGA AGACGCCTGA GCGCATCCGC
AAGGTGTGCG AAGACATCGT CGAGCACTTT CAGGCCAAGG TGGAGCCCAA TGGCTTCAAG
GGCCAGATCG TAACGTTCGA TCGCGAGTCC TGCCTGCTGT TCAAGGCCGA GCTGGACAAG
CTGCTGCCGC CCGAGGCTAC AGACATCGTG ATGTCGGTGC AGGCGGCGGA CAAAAAAGAA
CACCCTGAGT ACGCGCCTTA CGACCGAAGC CGCGATGAAG AAGAGCGACT GCTCGATCGC
TTCCGCGACC CGGCCGACCC GCTGAAGCTG ATCATCGTCA CCGCTAAGCT GCTGACCGGC
TTCGACGCGC CCATCCTGCA GGCCATGTAC CTGGACAAGC CGCTGCGCGA CCACACGCTG
CTCCAGGCCA TCTGCCGGGT GAACCGTACC TACTCCGAGC AGAAGACCCA CGGCCTGATC
GTGGACTACC TCGGCATCTT CGATGACGTC GCGGCGGCGC TGGAATTCGA CGACCAGAGC
GTCAAGCAGG TGGTCAGCAA CATCCAGGAG TTGAAGGACA AGCTGCCCGA AGCCATGCAG
AAGTGCCTGG CCTTCTTCTC TGGCTGCGAT CGCAGTTTGC AAGGCTACGA GGGCCTGATC
GCCGCGCAGC AGTGTCTGCC CAATAACGAG GTGCGAGACA ACTTTGCTGC CGAGTACAGC
GTGCTCAACA AGATCTGGGA GGCGCTGTCA CCGGACACCG TTCTGGGCCC CTTCGAGAAG
GACTACAAGT GGTTGTCGCA GGTGTACCAG TCGGTACAGC CCTCTAGTGG CCACGGCAAG
TTGATCTGGC ATTCGCTGGG CGCCAAGACC ATCGAGCTGA TCCACCAGAA CGTGCATGTC
GACGCGGTGC GGGATGACCT CGACACCTTG GTGCTGGACG CTGATCTGCT GGAAGCGGTG
CTGTCGAACC CAGACCCGAA GAAGGCCAAG GAGATTGAGA TCAAGCTCAA GCGCCGGCTG
CGCGGGCATG GCGGCAACCC CAAGTTCAAG AAGCTGTCGG AGCGGCTCGA TGCGCTGAAG
GACCGCTTCG AATCCGGGCA GATCAACAGC GTCGAGTTTC TGAAGCAGTT GCTGGAGATC
GCCAAGGAGA CGCTGCAGGC CGAGAAGGAC GTGCCGTCCG AAGAGGACGA GGATCGCGGT
AAGGCGGCTC TCACCGAGTT GTTCAACGAG GTCAAGACTT CTGAGACGCC CATCATGGTC
GAACGCGTGG TCACGGACAT CGACGAGATA GTGCGACTGG TCCGCTTCCC GGGCTGGCAA
GGTACGCAGG CCGGCGAGCG TGAAGTCAAG AAGGCCCTGC GCAAAGCCCT CTTCAAATAC
AAGCTGCATG CGGATGAAGA GCTGTTCGAG AAGGCCTACA GCTACATCCG GCAGTATTAC
TGA
 
Protein sequence
MFNESNTVEA YVRDLLAGPI KAAPVNTAQE PQASYGPSPK GIGWRYAAPS EVPRQIQEVL 
VEPWLRDALI RLNPEIAAQP DRADEVLYKL RAIVLSVRSD GLIRANEEMT AWMRGERSMP
FGHNNEHVPV RLIDLDDLSQ NHYIVTQQYT YRAGPTERRA DLVLLVNGLP LVLIEAKTPV
KKCISWVDAA VQVHDDYEKF VPELFVCNVF SVATEGKAYR YGSIGLPVKD WGPWHLDGDG
EDSQHHPLKS LKLSAESMLR PHVVLDILGS FTLFATNKKK QRIKIICRYQ QFEAANKIVE
RVLAGYPRKG LIWHFQGSGK SLLMVFAAQK LRMHAGLKNP TVLIVVDRID LDSQITGTFT
GADIPNLEKA DTREKLQQLL AQDVRKIIIT TIFKFGEGPN STKGGSLNDR SNIIALVDEA
HRTQEGDLGR KMREALPNAF LFGLTGTPIN RADRNTFYAF GADEDEKGYM SRYGFEESIR
DGATLKLHFE PRLIDLHIDK AALDAAYKDL TGGLSDLDKD NLAKTAAKMA VLVKTPERIR
KVCEDIVEHF QAKVEPNGFK GQIVTFDRES CLLFKAELDK LLPPEATDIV MSVQAADKKE
HPEYAPYDRS RDEEERLLDR FRDPADPLKL IIVTAKLLTG FDAPILQAMY LDKPLRDHTL
LQAICRVNRT YSEQKTHGLI VDYLGIFDDV AAALEFDDQS VKQVVSNIQE LKDKLPEAMQ
KCLAFFSGCD RSLQGYEGLI AAQQCLPNNE VRDNFAAEYS VLNKIWEALS PDTVLGPFEK
DYKWLSQVYQ SVQPSSGHGK LIWHSLGAKT IELIHQNVHV DAVRDDLDTL VLDADLLEAV
LSNPDPKKAK EIEIKLKRRL RGHGGNPKFK KLSERLDALK DRFESGQINS VEFLKQLLEI
AKETLQAEKD VPSEEDEDRG KAALTELFNE VKTSETPIMV ERVVTDIDEI VRLVRFPGWQ
GTQAGEREVK KALRKALFKY KLHADEELFE KAYSYIRQYY