Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0986 |
Symbol | |
ID | 4446163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1063872 |
End bp | 1066994 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639688792 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_830483 |
Protein GI | 116669550 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACC TGAACGAGTC AACGGTAGAG CTGGCGGTGC TTCAGTATCT GCGCGAACTC GGCTACACGA CGGCCTTCGG CCCAGACATC GCGCCCGAAG CGGCTGCGGC TGAGCGTTCG TCTTACGAGC AGGTATACCT GTTGAGCCGC TTGCGCGCGG CGGCGGTTCA GATCAACCGT GGCCTTGATG CGGCTCTCAT TGATGAGGCG ATCAAGCGCC TCGGCCGGGC GGAATCACAG AACCCAGTTG CGGAGAACCT TCGCGTCCAC GAACTCCTGA CCGAGGGCGT TCCGGTCGAA CACCGCGACG CCGAAGGCTC TGTACGGACG ACTCGTGTCC GCCTCATCGA CTTCGAGGAC CCGGCAGGCA ACGACTGGCT CGCCGTCAAC CAGTTCACAA TCATCGAGAA CGGCAAGAAC CGACGTCCGG ACGTCGTGCT CTTCTTGAAC GGCATGCCGC TTGGCCTGCT GGAGCTGAAG AACCTGGCCG ACGAGCACGC GACGCTCAAG GGAGCCTGGA ACCAAATCCA GACCTACCGC CATGACATTC CATCCCTGTT CACTCCGAAC GCGGTGACGG TCGTCAGCGA TGGCGTCAGC GCCGCCATGT CGTCATTCAC GGGCGGCTTC GAGCACTATG CGCCGTGGAA AACGATCGAC GGACGCGAGG TCGTGATGAA TCTGCCCGCA GTCGAGGTGT TGATCAAGGG CGTCTTCGAC CAGAAGCGCT TCCTCGACAT TCTGCAGAAC TTCATCGTCT TCAGCGACGA GTCCAAGGGG CTCGTGAAGC GCGTCGCGAA ATACCACCAG TACTGGGCCG TCAATGCCGC GGTCGAGTCG ACCATCGAGG CAGCAGGTCC CGATGGCGAC CGCCGCGGCG GCGTCGTGTG GCACACCCAA GGGAGTGGCA AGTCGATCGA GATGTTGCTC TACGCGGCGA AGATCATGCG CGACATTCGG ATGGGCAACC CGACGTTGTT GTTCATCACC GACCGCAACG ACCTCGACGA CCAGCTCTTC GGCGAGGTGT TTGCACCAGC CGAGATCCTG CCTGAGAAGC CCGTCCAAGC TGACTCCAGA GCAGACCTTC GCAGCCTGCT CCGCCGCGCG TCCGGCGGCA TCATCTTCAC CACCGTGCAG AAGTTCGCCC CCGAGGCTGG TGGCGACACC AACCCGGTAC TGACCGACCG CCGCAACGTC GTCGTTGTCG CCGACGAGGC TCACCGATCC CAATACGGCT TCACCGAGTC CCTTGACGAG CGCACCGGGC AGTTGAAGTC TGGGCTCGCG AAGCACATGC GCGACGCCCT CCCGAACGCC ACCTATCTCG GGTTCACGGG CACACCCATC GAGTCGAACG ACAAGTCAAC TCGCTCCGTG TTCGGTGACT ACATCGACAT CTATGACCTC ACGCGCGCTG TTGAGGACGG TGCCACCGTC CGGATTTTCT ACGAGTCCCG ACTCGCGAAG GTGTCCCTCG ATGCTGATGT GCACGCTGCA ATCGACGAAC TCGCCGACGA AATAACCGAG ACCGCAGAAG AGGACGAGGC CACCCGCGCC AAGTCCAGGT GGGCACGGTT GGAAGCCGTC GTGGGCGCGA ATGACCGTCT CGATGTGATT GCAGGCGACA TCGTCGACCA CTGGGAGAAA CGCCGGACCG AGATGTTCGG CAAGGCCATG ATTGTCACGA TGTCGCGGCG CATCGCCGTC GACCTCTACG ACAAGATCGT TAAGCTCAAG CCGGAGTGGC ACACCGACGA CCCGACGACA GGCATGATCA AGGTCGTCAT GACCGGTTCG GCGGCCGACC CTCAGGCCTT CCAGCCGCAC ATCTACGACA AGAAGACCCG CAAAGACCTT AAACTGCGGG CAAAGGACCC AAACGATTCC CTCGAGATCG TCATCGTCCG CGACATGTGG CTTACCGGCT TCGACGCCCC GTCGATGCAC ACTATGTACG TCGACAAACC AATGCAGGGC GCCGGCCTGA TGCAAGCCAT CGCACGGGTG AACCGTACCT TCCGCGACAA GCCCGGCGGC CTGATCGTCG ACTACATCGG CGTCGCCACA AATCTGCGCA GGGCCCTAGC CGAGTACTCC CCCAGCGATC GTGACCAGGC CGGCGTGCCG ATTGAGGAGA TCGTCTCCGC CATGTTGGAA AAGCACGACA TCGTGCGAGG ACTCCTTCAC GGTTGCAGGT ACAATTCCTC GCCGCTACTG GCGCCCGCCG CACGCCTAGC CCAGCACGCG CTCGTTCTCG ACTTCGTTAT GGCCGACCCG GACCGCACCG CACGTTACCT CGACCAAGTG CTCGCGCTAG CCAAGGCCTT TGCGCTCTGC GGGGCGCGGG ATGAAGCAGC TGCGATCCGT AATGACGTGC GGATGTTCGC CGATGTCCGA GCGGCGACCC TGAAGATCCA GAATCCGGAC TCAGGGCGTG CCGGCAGCGG TGCCGTAGAA ATAGACACCG CGATCGGGCA ACTCGTCAAC GAAGCCGTCA CCGCCGACGA GGTCGTTGAT ATCTACAAGC TCGCCGGCAT TGAAACTCCA GAGCTGTCGA TCCTGTCGGA CGAGTTCCTC GACACCCTGG CCGGGAAGGA GAAGCCCAAC CTCCAGATGG GGCTCCTCCG CCGGCTGATC AACGATCAAA TCCGCACCGT CCAGCGCACC AATATCGTTC AGGCACGAAA ATTCTCCGAG CAGCTCGACG AGGCAATTAA CCGCTATACG AACCGCACAC TGACGACAGC AGAAATCATT GCCGAGCTCG TCAAGCTCGC CAAAGACATG CGAAACCAGA ACGACCGTCA CAACAGACTC GGTCTTTCTG TCGCCGAGGC TGCATTTTAC GATGCCATCG TGCAAAACGA CGTCGCTGTC CTCCAGATGG GCGACGACAC GCTAAAGAAG ATTGCCGTCA ATCTCGTTTC CACCGTCCAG CGGAGCGCCA CAATCGACTG GTCCCTCAAA CATTCGGTCC GGGCCGCCAT GAGATCCAAA ATCCGTCGGC TACTTGCAAG GTACGACTAC CCGCCCGATC ACGAGGAGAA GGCGATTGAG CTGATACTCC GACAAGCTGA GCTAATCGCC GGGACCGAAG CGCAGTCAAC GGTACGCACT TGA
|
Protein sequence | MSDLNESTVE LAVLQYLREL GYTTAFGPDI APEAAAAERS SYEQVYLLSR LRAAAVQINR GLDAALIDEA IKRLGRAESQ NPVAENLRVH ELLTEGVPVE HRDAEGSVRT TRVRLIDFED PAGNDWLAVN QFTIIENGKN RRPDVVLFLN GMPLGLLELK NLADEHATLK GAWNQIQTYR HDIPSLFTPN AVTVVSDGVS AAMSSFTGGF EHYAPWKTID GREVVMNLPA VEVLIKGVFD QKRFLDILQN FIVFSDESKG LVKRVAKYHQ YWAVNAAVES TIEAAGPDGD RRGGVVWHTQ GSGKSIEMLL YAAKIMRDIR MGNPTLLFIT DRNDLDDQLF GEVFAPAEIL PEKPVQADSR ADLRSLLRRA SGGIIFTTVQ KFAPEAGGDT NPVLTDRRNV VVVADEAHRS QYGFTESLDE RTGQLKSGLA KHMRDALPNA TYLGFTGTPI ESNDKSTRSV FGDYIDIYDL TRAVEDGATV RIFYESRLAK VSLDADVHAA IDELADEITE TAEEDEATRA KSRWARLEAV VGANDRLDVI AGDIVDHWEK RRTEMFGKAM IVTMSRRIAV DLYDKIVKLK PEWHTDDPTT GMIKVVMTGS AADPQAFQPH IYDKKTRKDL KLRAKDPNDS LEIVIVRDMW LTGFDAPSMH TMYVDKPMQG AGLMQAIARV NRTFRDKPGG LIVDYIGVAT NLRRALAEYS PSDRDQAGVP IEEIVSAMLE KHDIVRGLLH GCRYNSSPLL APAARLAQHA LVLDFVMADP DRTARYLDQV LALAKAFALC GARDEAAAIR NDVRMFADVR AATLKIQNPD SGRAGSGAVE IDTAIGQLVN EAVTADEVVD IYKLAGIETP ELSILSDEFL DTLAGKEKPN LQMGLLRRLI NDQIRTVQRT NIVQARKFSE QLDEAINRYT NRTLTTAEII AELVKLAKDM RNQNDRHNRL GLSVAEAAFY DAIVQNDVAV LQMGDDTLKK IAVNLVSTVQ RSATIDWSLK HSVRAAMRSK IRRLLARYDY PPDHEEKAIE LILRQAELIA GTEAQSTVRT
|
| |