Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4323 |
Symbol | |
ID | 4443504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008538 |
Strand | - |
Start bp | 57819 |
End bp | 62660 |
Gene Length | 4842 bp |
Protein Length | 1613 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639687644 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_829341 |
Protein GI | 116662287 |
COG category | [R] General function prediction only |
COG ID | [COG4889] Predicted helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTCTA ACACTTTCGA GCAGCTGCTC GACCGTCTCT ATTTCTCCGC CAAAAATGAG CGGGACAAGG GCACGAAATT CGAGCGGTTG TTCAAGCGGT ATCTGCAGTT GGAACCCAAG TACTCGGATC AGTTCTCCGA TGTGTGGCTA TGGGATGAGT GGCCGGACCG CAGAGGGCAG GTGGATACGG GGATCGATCT GGTCGCGAAG GACCGCTACA CCGGGGAACT GACTGCGATC CAGTGTAAGT TCTATGATCC GCAACGGACA CTGGATAAGA AGCACATCGA TTCTTTTTTC ACGGCCGCCG GCAAGGTCGA CTTCTCCTAC GGTCTGGTCG TTTCCACCAC GGACAAGTGG TCCAAGCACG CCGAAACCGC ACTGGAGGGC CAGTCGAAGC CGATGACCAG GTTGCGGCTG CAGGATCTGG CGGATTCGAC CATTGATTGG GCTGAGTTTG ATCTGGACCG GCCCGAGGAA ATGCGGCAGA TCGATCGTAA GGAACCGCGC AAGTATCAGC GCGATGCGAT CGACGATGTG ATCACAGGAT TTCAGACCTC TGATCGCGGG AAGCTGATCA TGGCTTGCGG CACGGGCAAG ACCTACACCT CGTTGAAGAT CGTCGAGGAA ATGGTTCCTG TCGGCGGTAC CGCCCTGTTC CTGGTGCCCT CCATAGCTCT GCTGCAGCAG ACGCTGAACG AGTGGACGGC CCAGGCCACC GTTCCGTTGC GGCCGCTGGC CGTGTGTTCG GACACGAAGG TCGGCCGCCG CGAACACGAG GACGTGTCCG TACACGATCT GGCGTTCCCT GCCACCACGG ACCCGCAGAA ACTCTTCTAC CGGACGAGCA TCAGCACCGG CCAGGAAGCG GTCACAGTGG TGTTCTCTAC CTACCAGTCC ATCGACGTCA TCGCCCAGGC CCAGGCCCTC GGACTGCCGG ACTTTGATGT GATCCTTTGC GATGAGGCAC ACCGGACGAC GGGGATTACG GAGGCGGAGC ATGATGACTC GGCCTTCGTG CGTGTCCACG ACCAGGCCTA CCTGCGCGCC AAGAAACGTC TGTATATGAC GGCGACGCCG CGGATCTATG TGCAGGATTC GAAGGCGAAG GCGGCAGAGA ACGATGTGGC CGTTTACTCC ATGGATGATG TCGCAGTCTA CGGGCCGGAA TTCCACCACC TGGGCTTCGG TAAAGCCGTA GAGATGGGTC ACCTCTCCGA TTACAAGGTC CTGGTCCTGG CGGTAAATGA GGAAGCGGTG TCCCGATCCT TCCAGGGCTT GTTCCAAGAA AATGGAGACC TGTCCCTGGA TGACGCCGCC CGAATCGTGG GCTGCTGGAA CGGCCTGTCC AAGCGCGGCG TCAACGGCGA GCGTCTCTCC ATCGGAGATA CGTCCCCTAT GAACAGGGCG GTCGCGTTCG CCCGGAACAT CAAGGAATCC AAGAAACTGG CCGAACAGTT CGAACTCATC GGCCGGCAAC TGCTGGTCGA AGATGACGAC GCGTTGAAGC TGGAAGCCGA ACATGTCGAC GGCACGTTCA ACGTCCTCGA ACGCTCCGCG AAACTGGACT GGCTGCAGGA CGAAACCAAG GGCAACGTGT GCCGGATCCT GTCCAACGCC AAATGCCTCA CCGAAGGCGT GGACGTCCCG TCCTTGGACG CGGTGCTATT CCTGAACCCC CGCAACTCCC AGGTCGACGT GGTGCAGGCC GTGGGCCGTG TCATGCGCCG CTCCGAAGGG AAGGAATACG GGTACATCAT CCTGCCGATC GCGGTTCCGG CCTCCGAAGA CCCGGAAACC GCGCTGAACG ACAACAAGAA GTACAAGGTC GTCTGGGATG TCCTCCAGGC ACTGCGGGCC CATGATGACC GGTTCGAAGC GATGATCAAC AAGCTGGACC TGAACGGCAA CACGAACGAC AAAATCGATA TCATCACCGT TGCCGACCCG TTCGGTCCAG GAGACGGCCC GGGTGAGATG CCCGGTTCCT CGGATAGGCC GGGACCCGAG GCGTTGTTCC ATATGGCCAA TGCGGACGAG TGGCGTAACG CGATTTTCGC CCGTATGGTC CGCAAGGTCG GGGACCGCCG CTACTGGGAA CAATGGGCCG AAGACGTCAA GGGCATTGCT GACCGGCACA TCATCCGGAT CCGTACGATA CTCGATGGCC CTGATGCCCG GGTGCGGGAC GAATTCGCTG GCTTCCTAGA GGGCCTGCGG GGGAACCTGA ACGCCTCCAT CAGCGAGTCC GACGCGATCG ACATGCTCTC CCAGCACTTG ATCACCAAGC CCGTGTTCGA GGCGTTGTTC GAGGACTATT CTTTCGCCGC TCACAACCCG GTGTCCCAGG TCATGGACTC CATGGTGTTG GTACTCGAGC AGTACAACCT TGACTCGGAG GTCCAGAACC TCGAAGACTT TTACCGATCC GTGCGGGTGA AAGCCGAGGG TGTGGGCACG GCCGCCGGCA AGCAGAAGAT CATCACTGAG CTCTACGAAA AGTTCTTCAA GCTCGCTTTC CCCCGCACCG CCGAATCACT GGGAATCGTT TACACGCCGG TTGAAGTCGT CGATTTCATC CTGCGCGCCG TCGACGACGT ATTGAAGAAA GAGTTCGGGG TTTCAATCTC CGATGAAGGC GTCCACGTAC TCGACCCGTT CACCGGGACC GGGACGTTCG TCGTGCGCCT GTTGCAGTCA GGACTGATCA AACCCGAAGA CCTACTGCGC AAGTACACCC AGGAGCTGCA CGCCAACGAG CTGCTGCTCA TGGCCTACTA CATCGCGGCG ATCAACATCG AGGCCACCTT CCACGGAATC CTCACTGAAC AAGCCGTCGA ACAGGGCCGT GACGCTGACA CGGTCGGCTA TGAATCGTTT GGGGGGATCG TGCTCACCGA CACCTTCCAA ATGACGGAAG ACGGAGACAC CCTTGACGAA CACGTCTTCA CCAACAATAA CGATCGCGTG GTCAAGCAGA ACGCTCTCGA TATCCGGGTG ATCATCGGCA ACCCGCCTTA CTCCGTCGGC CAATCCAGCG GCAACGACAA CAACGCCAAC CTCAAATACC CGACCCTGGA CGAGTCCATC CGCAGAAGTT ATGTGGCACA GTCCACGGCA ACGAATGTGA ACTCCCTCTA TGACTCATAT ATCCGCGCCA TCCGCTGGGC TTCCAATCGC GTACTGAACT CCGAACACGG CGGAGTGGTT TGTTACGTCT CCAACGGGGG ATACATCGAC GGCAACACCG CCGACGGACT CCGCAAGACT CTCACGACTG AGTTTCACGA GATCTACGTT TACAACCTGC GTGGGAACGC GCGGGGTGCC GGGGAACAAC GGCGGAAGGA AAAGGACAAC GTCTTTGGCG AAGGCAGCAA GACGACGGTT GCCGTTCTGC TGCTCGTCAA GCGCCCCGGA GCTGTGGCCG GATGCCGGCT CAATTACCGG GACATCGGTG ACTACCTCGA CCGCAAGCAA AAACTTGCCA TCGTCGACGA AGCCACCCTG GCCACCATTC CCTGGGAACG ACTTACCCCT AACGTGGAGG GAGACTGGAT CAACCAACGA GACGATATCT TCGAAACTTT CACGCCCATC GGCTCCCGGG CCAAAGAAGG AATTCGAATC TTCAACATCT TCTCGCGGGG GCTCGAAACC GGGCGAGATG CGTGGGTTTA CAACAGCAGC CGGTCAGCCA TGGCCCAGAA CGTTGATCGA CATGTGTCCG CCTACGAGGC CGACCGGAGC ACCATCAAGC CCGGCCTGAA AACGGGCACG CTCGCGACGC GAGTGGGCCA ATTGACACCC AGACTGAACT CGAACGGAAT GGAGATCAGC TGGACTAGAA GCCTGCGCCA ATCACTCGCC AGGGATGAAG AAGTGGAATA CAGGGAATCT GCTATTCGGA CTGCGACTTA TCGACCGTTC AACAAGCAGG CCGTTTACTA CGAACGCAAG ATGAACCACG AATGGTCCCA GTTAGAATCG ATCTTCCCAC CTAGCGGCGA AAATAACTTT GGCTTCTACA TTGTTGGGAA CGGTTCGGCT GTGCCGTTTG GCGTGTTGAT GACCGATCTT GTGCCGGACC TCCACGTCAC TGGGGCCGGA AGCGGAGGCC AGTTCTTCCC CCGCTACTCG TACACCAAAC CAGTTCATGG CGACGACTTG TTATCGGAAC TTACGGCCTC GCCGCTAGAC GCTGAGGACG GGCGCACCGA CAACGTGACA GACGCGGCCC TGGCCGATTA TCGGACGTTT TACGGATCGT GCGTGAGCAA AGACGACATC TTCTACTACG TCTATGGCAT CCTCCACTCG CCCGACTACC GTGAACGCTT CGCCGCGGAC CTCAAGAGAA TGCTCCCCCG CATTCCGAAA ATAGCAGGCA ACGACTTTCA TGCGTTCGCT GATGCCGGCC GGCAGTTGGC AGCCCTGCAC ATCGGTTACG AGGACCTAGA CCCGTTCCCG CTGAATGAAC GTCATACCGG TTTGGTGCTA GATGCCGACG ACTACACCAG GTACGCAGTG ATCAAGATGA AGTACGAAGG TAAGGCCGGA TCGTGGGATA AGACCCGCAT AATTTACAAC GGAAATATCA CCCTTGAAGG CATTCCCGTC GAGGTCCACG AGTACATGCT CGGTTCGCGC TCCGCCTTGG ATTGGATCCT AGAGAGGTAC CGGGTCAAGA CCGACAAGGA CTCCGGTATT GTCAATGACC CCAATGATTG GTCCCGCGAT CATCAGGAGC CGCGCTACAT CATCGACCTA ATTGCCAAGA TCGTGACATT GAGCCTTGAA TCCAACCGGA TCATCGGAGC CCTGCCCGAG CTCGCCGTGT GA
|
Protein sequence | MGSNTFEQLL DRLYFSAKNE RDKGTKFERL FKRYLQLEPK YSDQFSDVWL WDEWPDRRGQ VDTGIDLVAK DRYTGELTAI QCKFYDPQRT LDKKHIDSFF TAAGKVDFSY GLVVSTTDKW SKHAETALEG QSKPMTRLRL QDLADSTIDW AEFDLDRPEE MRQIDRKEPR KYQRDAIDDV ITGFQTSDRG KLIMACGTGK TYTSLKIVEE MVPVGGTALF LVPSIALLQQ TLNEWTAQAT VPLRPLAVCS DTKVGRREHE DVSVHDLAFP ATTDPQKLFY RTSISTGQEA VTVVFSTYQS IDVIAQAQAL GLPDFDVILC DEAHRTTGIT EAEHDDSAFV RVHDQAYLRA KKRLYMTATP RIYVQDSKAK AAENDVAVYS MDDVAVYGPE FHHLGFGKAV EMGHLSDYKV LVLAVNEEAV SRSFQGLFQE NGDLSLDDAA RIVGCWNGLS KRGVNGERLS IGDTSPMNRA VAFARNIKES KKLAEQFELI GRQLLVEDDD ALKLEAEHVD GTFNVLERSA KLDWLQDETK GNVCRILSNA KCLTEGVDVP SLDAVLFLNP RNSQVDVVQA VGRVMRRSEG KEYGYIILPI AVPASEDPET ALNDNKKYKV VWDVLQALRA HDDRFEAMIN KLDLNGNTND KIDIITVADP FGPGDGPGEM PGSSDRPGPE ALFHMANADE WRNAIFARMV RKVGDRRYWE QWAEDVKGIA DRHIIRIRTI LDGPDARVRD EFAGFLEGLR GNLNASISES DAIDMLSQHL ITKPVFEALF EDYSFAAHNP VSQVMDSMVL VLEQYNLDSE VQNLEDFYRS VRVKAEGVGT AAGKQKIITE LYEKFFKLAF PRTAESLGIV YTPVEVVDFI LRAVDDVLKK EFGVSISDEG VHVLDPFTGT GTFVVRLLQS GLIKPEDLLR KYTQELHANE LLLMAYYIAA INIEATFHGI LTEQAVEQGR DADTVGYESF GGIVLTDTFQ MTEDGDTLDE HVFTNNNDRV VKQNALDIRV IIGNPPYSVG QSSGNDNNAN LKYPTLDESI RRSYVAQSTA TNVNSLYDSY IRAIRWASNR VLNSEHGGVV CYVSNGGYID GNTADGLRKT LTTEFHEIYV YNLRGNARGA GEQRRKEKDN VFGEGSKTTV AVLLLVKRPG AVAGCRLNYR DIGDYLDRKQ KLAIVDEATL ATIPWERLTP NVEGDWINQR DDIFETFTPI GSRAKEGIRI FNIFSRGLET GRDAWVYNSS RSAMAQNVDR HVSAYEADRS TIKPGLKTGT LATRVGQLTP RLNSNGMEIS WTRSLRQSLA RDEEVEYRES AIRTATYRPF NKQAVYYERK MNHEWSQLES IFPPSGENNF GFYIVGNGSA VPFGVLMTDL VPDLHVTGAG SGGQFFPRYS YTKPVHGDDL LSELTASPLD AEDGRTDNVT DAALADYRTF YGSCVSKDDI FYYVYGILHS PDYRERFAAD LKRMLPRIPK IAGNDFHAFA DAGRQLAALH IGYEDLDPFP LNERHTGLVL DADDYTRYAV IKMKYEGKAG SWDKTRIIYN GNITLEGIPV EVHEYMLGSR SALDWILERY RVKTDKDSGI VNDPNDWSRD HQEPRYIIDL IAKIVTLSLE SNRIIGALPE LAV
|
| |