Gene Arth_4323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4323 
Symbol 
ID4443504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp57819 
End bp62660 
Gene Length4842 bp 
Protein Length1613 aa 
Translation table11 
GC content58% 
IMG OID639687644 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_829341 
Protein GI116662287 
COG category[R] General function prediction only 
COG ID[COG4889] Predicted helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTCTA ACACTTTCGA GCAGCTGCTC GACCGTCTCT ATTTCTCCGC CAAAAATGAG 
CGGGACAAGG GCACGAAATT CGAGCGGTTG TTCAAGCGGT ATCTGCAGTT GGAACCCAAG
TACTCGGATC AGTTCTCCGA TGTGTGGCTA TGGGATGAGT GGCCGGACCG CAGAGGGCAG
GTGGATACGG GGATCGATCT GGTCGCGAAG GACCGCTACA CCGGGGAACT GACTGCGATC
CAGTGTAAGT TCTATGATCC GCAACGGACA CTGGATAAGA AGCACATCGA TTCTTTTTTC
ACGGCCGCCG GCAAGGTCGA CTTCTCCTAC GGTCTGGTCG TTTCCACCAC GGACAAGTGG
TCCAAGCACG CCGAAACCGC ACTGGAGGGC CAGTCGAAGC CGATGACCAG GTTGCGGCTG
CAGGATCTGG CGGATTCGAC CATTGATTGG GCTGAGTTTG ATCTGGACCG GCCCGAGGAA
ATGCGGCAGA TCGATCGTAA GGAACCGCGC AAGTATCAGC GCGATGCGAT CGACGATGTG
ATCACAGGAT TTCAGACCTC TGATCGCGGG AAGCTGATCA TGGCTTGCGG CACGGGCAAG
ACCTACACCT CGTTGAAGAT CGTCGAGGAA ATGGTTCCTG TCGGCGGTAC CGCCCTGTTC
CTGGTGCCCT CCATAGCTCT GCTGCAGCAG ACGCTGAACG AGTGGACGGC CCAGGCCACC
GTTCCGTTGC GGCCGCTGGC CGTGTGTTCG GACACGAAGG TCGGCCGCCG CGAACACGAG
GACGTGTCCG TACACGATCT GGCGTTCCCT GCCACCACGG ACCCGCAGAA ACTCTTCTAC
CGGACGAGCA TCAGCACCGG CCAGGAAGCG GTCACAGTGG TGTTCTCTAC CTACCAGTCC
ATCGACGTCA TCGCCCAGGC CCAGGCCCTC GGACTGCCGG ACTTTGATGT GATCCTTTGC
GATGAGGCAC ACCGGACGAC GGGGATTACG GAGGCGGAGC ATGATGACTC GGCCTTCGTG
CGTGTCCACG ACCAGGCCTA CCTGCGCGCC AAGAAACGTC TGTATATGAC GGCGACGCCG
CGGATCTATG TGCAGGATTC GAAGGCGAAG GCGGCAGAGA ACGATGTGGC CGTTTACTCC
ATGGATGATG TCGCAGTCTA CGGGCCGGAA TTCCACCACC TGGGCTTCGG TAAAGCCGTA
GAGATGGGTC ACCTCTCCGA TTACAAGGTC CTGGTCCTGG CGGTAAATGA GGAAGCGGTG
TCCCGATCCT TCCAGGGCTT GTTCCAAGAA AATGGAGACC TGTCCCTGGA TGACGCCGCC
CGAATCGTGG GCTGCTGGAA CGGCCTGTCC AAGCGCGGCG TCAACGGCGA GCGTCTCTCC
ATCGGAGATA CGTCCCCTAT GAACAGGGCG GTCGCGTTCG CCCGGAACAT CAAGGAATCC
AAGAAACTGG CCGAACAGTT CGAACTCATC GGCCGGCAAC TGCTGGTCGA AGATGACGAC
GCGTTGAAGC TGGAAGCCGA ACATGTCGAC GGCACGTTCA ACGTCCTCGA ACGCTCCGCG
AAACTGGACT GGCTGCAGGA CGAAACCAAG GGCAACGTGT GCCGGATCCT GTCCAACGCC
AAATGCCTCA CCGAAGGCGT GGACGTCCCG TCCTTGGACG CGGTGCTATT CCTGAACCCC
CGCAACTCCC AGGTCGACGT GGTGCAGGCC GTGGGCCGTG TCATGCGCCG CTCCGAAGGG
AAGGAATACG GGTACATCAT CCTGCCGATC GCGGTTCCGG CCTCCGAAGA CCCGGAAACC
GCGCTGAACG ACAACAAGAA GTACAAGGTC GTCTGGGATG TCCTCCAGGC ACTGCGGGCC
CATGATGACC GGTTCGAAGC GATGATCAAC AAGCTGGACC TGAACGGCAA CACGAACGAC
AAAATCGATA TCATCACCGT TGCCGACCCG TTCGGTCCAG GAGACGGCCC GGGTGAGATG
CCCGGTTCCT CGGATAGGCC GGGACCCGAG GCGTTGTTCC ATATGGCCAA TGCGGACGAG
TGGCGTAACG CGATTTTCGC CCGTATGGTC CGCAAGGTCG GGGACCGCCG CTACTGGGAA
CAATGGGCCG AAGACGTCAA GGGCATTGCT GACCGGCACA TCATCCGGAT CCGTACGATA
CTCGATGGCC CTGATGCCCG GGTGCGGGAC GAATTCGCTG GCTTCCTAGA GGGCCTGCGG
GGGAACCTGA ACGCCTCCAT CAGCGAGTCC GACGCGATCG ACATGCTCTC CCAGCACTTG
ATCACCAAGC CCGTGTTCGA GGCGTTGTTC GAGGACTATT CTTTCGCCGC TCACAACCCG
GTGTCCCAGG TCATGGACTC CATGGTGTTG GTACTCGAGC AGTACAACCT TGACTCGGAG
GTCCAGAACC TCGAAGACTT TTACCGATCC GTGCGGGTGA AAGCCGAGGG TGTGGGCACG
GCCGCCGGCA AGCAGAAGAT CATCACTGAG CTCTACGAAA AGTTCTTCAA GCTCGCTTTC
CCCCGCACCG CCGAATCACT GGGAATCGTT TACACGCCGG TTGAAGTCGT CGATTTCATC
CTGCGCGCCG TCGACGACGT ATTGAAGAAA GAGTTCGGGG TTTCAATCTC CGATGAAGGC
GTCCACGTAC TCGACCCGTT CACCGGGACC GGGACGTTCG TCGTGCGCCT GTTGCAGTCA
GGACTGATCA AACCCGAAGA CCTACTGCGC AAGTACACCC AGGAGCTGCA CGCCAACGAG
CTGCTGCTCA TGGCCTACTA CATCGCGGCG ATCAACATCG AGGCCACCTT CCACGGAATC
CTCACTGAAC AAGCCGTCGA ACAGGGCCGT GACGCTGACA CGGTCGGCTA TGAATCGTTT
GGGGGGATCG TGCTCACCGA CACCTTCCAA ATGACGGAAG ACGGAGACAC CCTTGACGAA
CACGTCTTCA CCAACAATAA CGATCGCGTG GTCAAGCAGA ACGCTCTCGA TATCCGGGTG
ATCATCGGCA ACCCGCCTTA CTCCGTCGGC CAATCCAGCG GCAACGACAA CAACGCCAAC
CTCAAATACC CGACCCTGGA CGAGTCCATC CGCAGAAGTT ATGTGGCACA GTCCACGGCA
ACGAATGTGA ACTCCCTCTA TGACTCATAT ATCCGCGCCA TCCGCTGGGC TTCCAATCGC
GTACTGAACT CCGAACACGG CGGAGTGGTT TGTTACGTCT CCAACGGGGG ATACATCGAC
GGCAACACCG CCGACGGACT CCGCAAGACT CTCACGACTG AGTTTCACGA GATCTACGTT
TACAACCTGC GTGGGAACGC GCGGGGTGCC GGGGAACAAC GGCGGAAGGA AAAGGACAAC
GTCTTTGGCG AAGGCAGCAA GACGACGGTT GCCGTTCTGC TGCTCGTCAA GCGCCCCGGA
GCTGTGGCCG GATGCCGGCT CAATTACCGG GACATCGGTG ACTACCTCGA CCGCAAGCAA
AAACTTGCCA TCGTCGACGA AGCCACCCTG GCCACCATTC CCTGGGAACG ACTTACCCCT
AACGTGGAGG GAGACTGGAT CAACCAACGA GACGATATCT TCGAAACTTT CACGCCCATC
GGCTCCCGGG CCAAAGAAGG AATTCGAATC TTCAACATCT TCTCGCGGGG GCTCGAAACC
GGGCGAGATG CGTGGGTTTA CAACAGCAGC CGGTCAGCCA TGGCCCAGAA CGTTGATCGA
CATGTGTCCG CCTACGAGGC CGACCGGAGC ACCATCAAGC CCGGCCTGAA AACGGGCACG
CTCGCGACGC GAGTGGGCCA ATTGACACCC AGACTGAACT CGAACGGAAT GGAGATCAGC
TGGACTAGAA GCCTGCGCCA ATCACTCGCC AGGGATGAAG AAGTGGAATA CAGGGAATCT
GCTATTCGGA CTGCGACTTA TCGACCGTTC AACAAGCAGG CCGTTTACTA CGAACGCAAG
ATGAACCACG AATGGTCCCA GTTAGAATCG ATCTTCCCAC CTAGCGGCGA AAATAACTTT
GGCTTCTACA TTGTTGGGAA CGGTTCGGCT GTGCCGTTTG GCGTGTTGAT GACCGATCTT
GTGCCGGACC TCCACGTCAC TGGGGCCGGA AGCGGAGGCC AGTTCTTCCC CCGCTACTCG
TACACCAAAC CAGTTCATGG CGACGACTTG TTATCGGAAC TTACGGCCTC GCCGCTAGAC
GCTGAGGACG GGCGCACCGA CAACGTGACA GACGCGGCCC TGGCCGATTA TCGGACGTTT
TACGGATCGT GCGTGAGCAA AGACGACATC TTCTACTACG TCTATGGCAT CCTCCACTCG
CCCGACTACC GTGAACGCTT CGCCGCGGAC CTCAAGAGAA TGCTCCCCCG CATTCCGAAA
ATAGCAGGCA ACGACTTTCA TGCGTTCGCT GATGCCGGCC GGCAGTTGGC AGCCCTGCAC
ATCGGTTACG AGGACCTAGA CCCGTTCCCG CTGAATGAAC GTCATACCGG TTTGGTGCTA
GATGCCGACG ACTACACCAG GTACGCAGTG ATCAAGATGA AGTACGAAGG TAAGGCCGGA
TCGTGGGATA AGACCCGCAT AATTTACAAC GGAAATATCA CCCTTGAAGG CATTCCCGTC
GAGGTCCACG AGTACATGCT CGGTTCGCGC TCCGCCTTGG ATTGGATCCT AGAGAGGTAC
CGGGTCAAGA CCGACAAGGA CTCCGGTATT GTCAATGACC CCAATGATTG GTCCCGCGAT
CATCAGGAGC CGCGCTACAT CATCGACCTA ATTGCCAAGA TCGTGACATT GAGCCTTGAA
TCCAACCGGA TCATCGGAGC CCTGCCCGAG CTCGCCGTGT GA
 
Protein sequence
MGSNTFEQLL DRLYFSAKNE RDKGTKFERL FKRYLQLEPK YSDQFSDVWL WDEWPDRRGQ 
VDTGIDLVAK DRYTGELTAI QCKFYDPQRT LDKKHIDSFF TAAGKVDFSY GLVVSTTDKW
SKHAETALEG QSKPMTRLRL QDLADSTIDW AEFDLDRPEE MRQIDRKEPR KYQRDAIDDV
ITGFQTSDRG KLIMACGTGK TYTSLKIVEE MVPVGGTALF LVPSIALLQQ TLNEWTAQAT
VPLRPLAVCS DTKVGRREHE DVSVHDLAFP ATTDPQKLFY RTSISTGQEA VTVVFSTYQS
IDVIAQAQAL GLPDFDVILC DEAHRTTGIT EAEHDDSAFV RVHDQAYLRA KKRLYMTATP
RIYVQDSKAK AAENDVAVYS MDDVAVYGPE FHHLGFGKAV EMGHLSDYKV LVLAVNEEAV
SRSFQGLFQE NGDLSLDDAA RIVGCWNGLS KRGVNGERLS IGDTSPMNRA VAFARNIKES
KKLAEQFELI GRQLLVEDDD ALKLEAEHVD GTFNVLERSA KLDWLQDETK GNVCRILSNA
KCLTEGVDVP SLDAVLFLNP RNSQVDVVQA VGRVMRRSEG KEYGYIILPI AVPASEDPET
ALNDNKKYKV VWDVLQALRA HDDRFEAMIN KLDLNGNTND KIDIITVADP FGPGDGPGEM
PGSSDRPGPE ALFHMANADE WRNAIFARMV RKVGDRRYWE QWAEDVKGIA DRHIIRIRTI
LDGPDARVRD EFAGFLEGLR GNLNASISES DAIDMLSQHL ITKPVFEALF EDYSFAAHNP
VSQVMDSMVL VLEQYNLDSE VQNLEDFYRS VRVKAEGVGT AAGKQKIITE LYEKFFKLAF
PRTAESLGIV YTPVEVVDFI LRAVDDVLKK EFGVSISDEG VHVLDPFTGT GTFVVRLLQS
GLIKPEDLLR KYTQELHANE LLLMAYYIAA INIEATFHGI LTEQAVEQGR DADTVGYESF
GGIVLTDTFQ MTEDGDTLDE HVFTNNNDRV VKQNALDIRV IIGNPPYSVG QSSGNDNNAN
LKYPTLDESI RRSYVAQSTA TNVNSLYDSY IRAIRWASNR VLNSEHGGVV CYVSNGGYID
GNTADGLRKT LTTEFHEIYV YNLRGNARGA GEQRRKEKDN VFGEGSKTTV AVLLLVKRPG
AVAGCRLNYR DIGDYLDRKQ KLAIVDEATL ATIPWERLTP NVEGDWINQR DDIFETFTPI
GSRAKEGIRI FNIFSRGLET GRDAWVYNSS RSAMAQNVDR HVSAYEADRS TIKPGLKTGT
LATRVGQLTP RLNSNGMEIS WTRSLRQSLA RDEEVEYRES AIRTATYRPF NKQAVYYERK
MNHEWSQLES IFPPSGENNF GFYIVGNGSA VPFGVLMTDL VPDLHVTGAG SGGQFFPRYS
YTKPVHGDDL LSELTASPLD AEDGRTDNVT DAALADYRTF YGSCVSKDDI FYYVYGILHS
PDYRERFAAD LKRMLPRIPK IAGNDFHAFA DAGRQLAALH IGYEDLDPFP LNERHTGLVL
DADDYTRYAV IKMKYEGKAG SWDKTRIIYN GNITLEGIPV EVHEYMLGSR SALDWILERY
RVKTDKDSGI VNDPNDWSRD HQEPRYIIDL IAKIVTLSLE SNRIIGALPE LAV