Gene Arth_0986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0986 
Symbol 
ID4446163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1063872 
End bp1066994 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content61% 
IMG OID639688792 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_830483 
Protein GI116669550 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACC TGAACGAGTC AACGGTAGAG CTGGCGGTGC TTCAGTATCT GCGCGAACTC 
GGCTACACGA CGGCCTTCGG CCCAGACATC GCGCCCGAAG CGGCTGCGGC TGAGCGTTCG
TCTTACGAGC AGGTATACCT GTTGAGCCGC TTGCGCGCGG CGGCGGTTCA GATCAACCGT
GGCCTTGATG CGGCTCTCAT TGATGAGGCG ATCAAGCGCC TCGGCCGGGC GGAATCACAG
AACCCAGTTG CGGAGAACCT TCGCGTCCAC GAACTCCTGA CCGAGGGCGT TCCGGTCGAA
CACCGCGACG CCGAAGGCTC TGTACGGACG ACTCGTGTCC GCCTCATCGA CTTCGAGGAC
CCGGCAGGCA ACGACTGGCT CGCCGTCAAC CAGTTCACAA TCATCGAGAA CGGCAAGAAC
CGACGTCCGG ACGTCGTGCT CTTCTTGAAC GGCATGCCGC TTGGCCTGCT GGAGCTGAAG
AACCTGGCCG ACGAGCACGC GACGCTCAAG GGAGCCTGGA ACCAAATCCA GACCTACCGC
CATGACATTC CATCCCTGTT CACTCCGAAC GCGGTGACGG TCGTCAGCGA TGGCGTCAGC
GCCGCCATGT CGTCATTCAC GGGCGGCTTC GAGCACTATG CGCCGTGGAA AACGATCGAC
GGACGCGAGG TCGTGATGAA TCTGCCCGCA GTCGAGGTGT TGATCAAGGG CGTCTTCGAC
CAGAAGCGCT TCCTCGACAT TCTGCAGAAC TTCATCGTCT TCAGCGACGA GTCCAAGGGG
CTCGTGAAGC GCGTCGCGAA ATACCACCAG TACTGGGCCG TCAATGCCGC GGTCGAGTCG
ACCATCGAGG CAGCAGGTCC CGATGGCGAC CGCCGCGGCG GCGTCGTGTG GCACACCCAA
GGGAGTGGCA AGTCGATCGA GATGTTGCTC TACGCGGCGA AGATCATGCG CGACATTCGG
ATGGGCAACC CGACGTTGTT GTTCATCACC GACCGCAACG ACCTCGACGA CCAGCTCTTC
GGCGAGGTGT TTGCACCAGC CGAGATCCTG CCTGAGAAGC CCGTCCAAGC TGACTCCAGA
GCAGACCTTC GCAGCCTGCT CCGCCGCGCG TCCGGCGGCA TCATCTTCAC CACCGTGCAG
AAGTTCGCCC CCGAGGCTGG TGGCGACACC AACCCGGTAC TGACCGACCG CCGCAACGTC
GTCGTTGTCG CCGACGAGGC TCACCGATCC CAATACGGCT TCACCGAGTC CCTTGACGAG
CGCACCGGGC AGTTGAAGTC TGGGCTCGCG AAGCACATGC GCGACGCCCT CCCGAACGCC
ACCTATCTCG GGTTCACGGG CACACCCATC GAGTCGAACG ACAAGTCAAC TCGCTCCGTG
TTCGGTGACT ACATCGACAT CTATGACCTC ACGCGCGCTG TTGAGGACGG TGCCACCGTC
CGGATTTTCT ACGAGTCCCG ACTCGCGAAG GTGTCCCTCG ATGCTGATGT GCACGCTGCA
ATCGACGAAC TCGCCGACGA AATAACCGAG ACCGCAGAAG AGGACGAGGC CACCCGCGCC
AAGTCCAGGT GGGCACGGTT GGAAGCCGTC GTGGGCGCGA ATGACCGTCT CGATGTGATT
GCAGGCGACA TCGTCGACCA CTGGGAGAAA CGCCGGACCG AGATGTTCGG CAAGGCCATG
ATTGTCACGA TGTCGCGGCG CATCGCCGTC GACCTCTACG ACAAGATCGT TAAGCTCAAG
CCGGAGTGGC ACACCGACGA CCCGACGACA GGCATGATCA AGGTCGTCAT GACCGGTTCG
GCGGCCGACC CTCAGGCCTT CCAGCCGCAC ATCTACGACA AGAAGACCCG CAAAGACCTT
AAACTGCGGG CAAAGGACCC AAACGATTCC CTCGAGATCG TCATCGTCCG CGACATGTGG
CTTACCGGCT TCGACGCCCC GTCGATGCAC ACTATGTACG TCGACAAACC AATGCAGGGC
GCCGGCCTGA TGCAAGCCAT CGCACGGGTG AACCGTACCT TCCGCGACAA GCCCGGCGGC
CTGATCGTCG ACTACATCGG CGTCGCCACA AATCTGCGCA GGGCCCTAGC CGAGTACTCC
CCCAGCGATC GTGACCAGGC CGGCGTGCCG ATTGAGGAGA TCGTCTCCGC CATGTTGGAA
AAGCACGACA TCGTGCGAGG ACTCCTTCAC GGTTGCAGGT ACAATTCCTC GCCGCTACTG
GCGCCCGCCG CACGCCTAGC CCAGCACGCG CTCGTTCTCG ACTTCGTTAT GGCCGACCCG
GACCGCACCG CACGTTACCT CGACCAAGTG CTCGCGCTAG CCAAGGCCTT TGCGCTCTGC
GGGGCGCGGG ATGAAGCAGC TGCGATCCGT AATGACGTGC GGATGTTCGC CGATGTCCGA
GCGGCGACCC TGAAGATCCA GAATCCGGAC TCAGGGCGTG CCGGCAGCGG TGCCGTAGAA
ATAGACACCG CGATCGGGCA ACTCGTCAAC GAAGCCGTCA CCGCCGACGA GGTCGTTGAT
ATCTACAAGC TCGCCGGCAT TGAAACTCCA GAGCTGTCGA TCCTGTCGGA CGAGTTCCTC
GACACCCTGG CCGGGAAGGA GAAGCCCAAC CTCCAGATGG GGCTCCTCCG CCGGCTGATC
AACGATCAAA TCCGCACCGT CCAGCGCACC AATATCGTTC AGGCACGAAA ATTCTCCGAG
CAGCTCGACG AGGCAATTAA CCGCTATACG AACCGCACAC TGACGACAGC AGAAATCATT
GCCGAGCTCG TCAAGCTCGC CAAAGACATG CGAAACCAGA ACGACCGTCA CAACAGACTC
GGTCTTTCTG TCGCCGAGGC TGCATTTTAC GATGCCATCG TGCAAAACGA CGTCGCTGTC
CTCCAGATGG GCGACGACAC GCTAAAGAAG ATTGCCGTCA ATCTCGTTTC CACCGTCCAG
CGGAGCGCCA CAATCGACTG GTCCCTCAAA CATTCGGTCC GGGCCGCCAT GAGATCCAAA
ATCCGTCGGC TACTTGCAAG GTACGACTAC CCGCCCGATC ACGAGGAGAA GGCGATTGAG
CTGATACTCC GACAAGCTGA GCTAATCGCC GGGACCGAAG CGCAGTCAAC GGTACGCACT
TGA
 
Protein sequence
MSDLNESTVE LAVLQYLREL GYTTAFGPDI APEAAAAERS SYEQVYLLSR LRAAAVQINR 
GLDAALIDEA IKRLGRAESQ NPVAENLRVH ELLTEGVPVE HRDAEGSVRT TRVRLIDFED
PAGNDWLAVN QFTIIENGKN RRPDVVLFLN GMPLGLLELK NLADEHATLK GAWNQIQTYR
HDIPSLFTPN AVTVVSDGVS AAMSSFTGGF EHYAPWKTID GREVVMNLPA VEVLIKGVFD
QKRFLDILQN FIVFSDESKG LVKRVAKYHQ YWAVNAAVES TIEAAGPDGD RRGGVVWHTQ
GSGKSIEMLL YAAKIMRDIR MGNPTLLFIT DRNDLDDQLF GEVFAPAEIL PEKPVQADSR
ADLRSLLRRA SGGIIFTTVQ KFAPEAGGDT NPVLTDRRNV VVVADEAHRS QYGFTESLDE
RTGQLKSGLA KHMRDALPNA TYLGFTGTPI ESNDKSTRSV FGDYIDIYDL TRAVEDGATV
RIFYESRLAK VSLDADVHAA IDELADEITE TAEEDEATRA KSRWARLEAV VGANDRLDVI
AGDIVDHWEK RRTEMFGKAM IVTMSRRIAV DLYDKIVKLK PEWHTDDPTT GMIKVVMTGS
AADPQAFQPH IYDKKTRKDL KLRAKDPNDS LEIVIVRDMW LTGFDAPSMH TMYVDKPMQG
AGLMQAIARV NRTFRDKPGG LIVDYIGVAT NLRRALAEYS PSDRDQAGVP IEEIVSAMLE
KHDIVRGLLH GCRYNSSPLL APAARLAQHA LVLDFVMADP DRTARYLDQV LALAKAFALC
GARDEAAAIR NDVRMFADVR AATLKIQNPD SGRAGSGAVE IDTAIGQLVN EAVTADEVVD
IYKLAGIETP ELSILSDEFL DTLAGKEKPN LQMGLLRRLI NDQIRTVQRT NIVQARKFSE
QLDEAINRYT NRTLTTAEII AELVKLAKDM RNQNDRHNRL GLSVAEAAFY DAIVQNDVAV
LQMGDDTLKK IAVNLVSTVQ RSATIDWSLK HSVRAAMRSK IRRLLARYDY PPDHEEKAIE
LILRQAELIA GTEAQSTVRT