Gene Arth_4173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4173 
Symbol 
ID4443634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008539 
Strand
Start bp10 
End bp4737 
Gene Length4728 bp 
Protein Length1575 aa 
Translation table11 
GC content65% 
IMG OID639687698 
Producthelicase domain-containing protein 
Protein accessionYP_829395 
Protein GI116662342 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG4646] DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGTCCA CCGTCCAGGA GCTCGGCTTC ACCGGGGGCG CTGTTCTGGA ACCGGGATCC 
GGTGTTGGCA CCTTCATGGG CTTCGCCCCG GATGCCGCCG AGATGACCGG TGTGGAACTG
GATCCAACCA CCGCGGCCAT CTCCCAGGCC CTTTACCCCG AGGCCACCGT CCGGGCCGAG
TCCTTTGCCG ATACCAAGCT GCCGACCGGG CACTTCGACC TGGCTATCGG CAACGTACCC
TTCGCGAACG TCACCCTGCA CGATCCCCGC CACAACGCCG GCGGCCATTC AATCCATAAC
CACTTCATAC TCAAATCCCT GGAACTGACC CGCCCGGGCG GGATGGTCGC CGTGCTGACG
TCCAGCTTCA CCCTGGACGG CACAAATCCC TCAGCCCGCC GGGAAATGAA CCAGCTGGCC
GACCTCGTGG GCGCCGTCCG GCTACCCACC GGCTCACACC GCAAGGCTGC AGGCACCGAT
GCCATGACGG ACCTGCTGAT CTTCCGCCGC CGGGAACCCG GGCAGGAACC GGCATCCACC
CTCTGGGAAA CCGTCACGGC CCGCCAGATC GACGGAACCA TCACCCGCCT GAACTCCTAC
TTCGACGAAT ACCCCGCACG GCTCCTTGGC GAACTGCACG TCAGCCACGG CATGTACGGG
GCCGAAACCC TGCAGCTCAC CACCGATGAG CTCTCCGCCG TCCCGGAACG GCTGAACGCG
GCCCTGGCCG ACGTCGTACG CGAGGCCCGG ACCGCCGGAA TGATGATGAC CGAGCGGACC
GCTGACCAGG AGCGCCAGCG GGCCGCCTAT GTTCCGGCGG CCGCCCATGA GTGGGACGGC
CACATCAGCG CCGCTGACAG CGGCTTTACC GTTGTCGAAA AAGGCTCACA CACCGAGTTG
GCAGTACCGA AGACCCAGGC CGCTGAACTT AGGGCCCTGC TGGGGCTGCG CGACAGCGCC
CGGCTGCTGC TCACCGCAGA GGCCGAAAGC CGCGAGGACA CCGCCGGCAT TGATGGGCTC
CGTGAGGAAC TGAAGAGCGC CTACGGCCGG TACGCGGAAA CCTACGGGCC GATCAACCGC
TACACGCTCC GGAATACAGG GCGCGTGGAT GAAGAGACGC AGGAGCCCAT CCAGGCCCGC
ATGACGCCCA AGGCAGTCTC CATCGTCTCC CGCGACCCGT TCGGTCCTCT GGTAATGGCG
TTGGAAAACT TCGATGAAGC GACGCAGACA GCTGCACCGG CCGCCCTGCT TTCGACCCGT
CAGGTCCAGC CGCGCCGACC GGTCCTTGGC GTGGACACGG CAGAAGAAGC CCTCACCGTC
ACCCTCGACA CCGTGGGCGA GGTTGACCTG GACTACGCAG CATCCCTGCT GGGCATCAGT
GCTGATGACG CCCGGGCCGC CATGGGGGAG AGCATCTACC AGATCCCCGG CACCGGGGAG
AGTTTCCAGA CCCGGGCCGA ATACCTTTCC GGGAACGTGC GGGAGAAGCT TGAAATTGCC
CAGGCAGCCG CGCTCTCCGA TGATCGGTTC GCCGTCAACG TCCAGGCCCT CACCGAAGCC
ATGCCGGAAC CGCTGCGCAT GGATGAGGTG GAAGCCCGCC TCGGTGCGGT CTGGATCGAC
GCAGACACCC ACCAGGAATT TGTCCGGGAA ATCCTCAACG ACCCCTACGC CACCGTCTCC
AACGCAGCCG GCTCCATGTG GGACGTCAAA GCCAACCGCC ACACCCTGGC CGCCACCAGC
AACTGGGGCA CCCAGCGGAT GCCCGCCTCC GACGTCCTCA AGCAGGTCCT GGAACAGCGC
CCGGTGCGTG TCACGGACGA AGGAGAGGAC AAGCGCCGCA TCCTGAACCC CACAGAAACC
GCAGCGGCCC AGGAAAAAGC CCAGATGCTG CAGGAGCGTT TCAGTGAATG GGTATGGGAA
GAACCCGAAC GGGCCACCCG TCTCATCGGC GAATACAACC GCCGCTTCAA TTCCATCGTG
CTGCGCGACT ACACCACCGA GGGGGAGCGG CTGACGCTGC CCGGGATGGC CAAGGACTTC
TCCCCCCGCC CGCACCAGCG CGCGGCCGTG GCCCGGATGC TCTCGGAGCC TGCCGTGGGC
CTGTTCCACC AGGTCGGGGC GGGAAAGACC GCCGAGATGG TCATGGGCGT GATGGAACTG
CGCCGTCTGG GTATGGTCAA CAAGCCCGCC GTCGTGATCC CCAACCACAT GCTCGAACAG
TTCGCCCGCG AATGGCTCCA GATCTACCCG CAGGCCCGGA TCCTCGCAGC ATCCTCGAAT
GACCTTGCCG GGGATAAGCG CCGGCAGTTC GTGGCCCGCG CGGCCGCCAA CGAGTGGGAC
GCCGTCGTCA TGACCCGCAC CGCGTTCCAG CGCGTGAGCC TCAGCCCCGA AGCGGAAGTC
TCGTACATGA GGGCCGAAGT CGCCCAGGCG CGGGCCGAGC TGGAAGCAGT CAGGGACAGC
GGCCAGGTCA ACGGACAGCC CAGCGCGAGC ATCGTCAAAC GGCTCGAAAA GGTGGTCCTG
GGACGGGAGG AAAGCCTCAA AGCCAAGCTC GATACCGCCG CCGATCCGGG AATCAGCTTC
GAGGAAACCG GCATTGACTA CCTCGTGGTG GACGAGCTGC ACGACTACAA GAACCTGGAG
ACGCCCAGCA ACATCCCCGG CGCCGGAATC CAAGGTTCCA ACCGTGCATC GGACCTGCAT
ATGAAGACAG AGTTCCTGCG CCAGCGCGAG GGCCGGCGGG TGATCACCGG AGCCACAGCG
ACACCCATCG CCAACTCCGT CACGGAAATG TACGTCATGC AGCGCTACCT GCGCCCGGAC
CTGCTCCAGG CTGCCGGAAT CCAGGACTTC AACACCTGGG CAGCAACCTT CGGGCAGGTC
GTCGAGGAAA TGGAACTCTC CGTCGCCGGC GGCGACCGGT TCAAACTCAA GAGCCGGTTC
GCGAAATTCC AGAACGTCCC CGAACTGTTG AAGATGTTCC ACACCTTCGC GGACGTGAAG
ACCGCCGAGG ACCTGAAGCT CCCCGTACCG GCCCTCGCCG CACGGGACGG GGACGGGCTG
CGGCAGCCGA ACATGCTCAC CGTGGAACCC AGCCCGGAAC TGCGGGAGTA CATCCAGGAC
ATCGGCCAGC GGGTGGACCG CATCCAGCAA AAGCTCGTGG ACGCCGAGGA AGACAACATG
CTCAAAGTCT CCTCGGACGG CCGGAAGGCC GCCCTGGACA TGCGACTCGT TGACCCCGCA
TTGTCCCAGC AGGGGTCCAC CAAGATCAGC GCCACCGCTG ACCTGCTGGC CAGCGTGTAC
GAAGAACACA AGGACCGCAT TTACACCGAC CCGAAAACCG GAGAACCGGA CCCCGTGCCC
GGAGCCCTGC AACTGGTGTT CTGCGACTTC GGCACACCCT CTGACCGGTG GAACGTCTAC
GGCGAACTCA AAGACCAGCT ACGGCGCCGC GGCGTGCCCG AACACATGGT CCGGTTCATC
CACGAAGCCA AGAATGACAC CGAGAAAGGC CGGCTCTTCG CCGCCGCCCG CTCCGGCCAG
ATCGCCGTGC TGATGGGATC GACCTCAAAA ATGGGTGTTG GCACCAACAT TCAAAAACGC
GCCGTGCACC TGGTCGATAT GGACGCCCCC TGGCGGCCCT CCGACGTCGA ACAACGCCAC
GGCCGCATTC TGCGCCAAGG CAACCAGAAC TCCGAGGTCC GCATCTCCCA GGTCGTCACC
AAGGAATCCT TCGATTCCTT CATGTGGCAG GGACTTGAAA GAAAGTCCCG GTTCATCAAC
CAGATCATGC GCGGACGCCT GGACGTGCGC GAAATCGAGG ACATCGGTGA CAACACCCTG
AACTTCGCCC AGGCCAAAGC CATCACCAGC GGCAACCCGC TGGTGATGGA AAAAGCCGTC
GCCGACCAGG AACTAGCCCG CCTGAGCCGC CTGGACCGGG CCTACAACCG CAATATGATT
GCTGTCGCAC ACACCAAACG CGGTGAACAA TCGGCAGCCG ACGCCGCCGC CCAGGACCTG
CCCCTGATCC AGGCAGCAGC AGCGCGGACC CTCGACACCA CCGCGGATGC CTTCAAAGCC
ACCATCGAAG GAACGTTCCT CGACAACCGC GGGGATGCCG CCGAAGCGGT CCGGGCATGG
GCCGGCAAAC ACGGCCACCG GCTCATGAAC CTCTACGGCT ACGACGAGCT GGGAACGATC
GCCACCCTTG GCGGCCATGA ACTCCGTGCC CGCCTCATGC CGGCCAGGGA CCTTGACCGG
GCAACCGTCG AAATACGCAT CGAGGGAGTG CCCAGGGCAA CCACACAAAT AGCCCGCCGC
AGCCTGCTGT CCGCCGATCT AGGCACCATC AGACAGCTCG AAAACAGAGT CTCATCCCTC
CCGAAGCTGG CAGCCGACGT CGAAGGACGA CGGCAAGATG CACTCACCCA CGTCGAACAG
GCAGACAAAG CCCTGGCGGA GCCTTTCAAA CACGCCGACG CCCTGAAAGC CGCCCTGACG
GACTCGGCAC GTATTAACCA ACTCATGGCC GAGGCAGCGA AACCGGAAGA ACCAGCGCAG
CCCGAAACCC CCGCCGACAT CGACTCCCGC ATGGCAAAGA TGCAACGGCT CATGAACGCT
GATTTCCCCG CACAACCAGG CACCGCAGCC CCACCGGCCG GCGGCACGAA AGCCCAACCC
GGGACAGTGA ACCAACAACG ACAACACCAG ACCGAGTACG AACGCTGA
 
Protein sequence
MWSTVQELGF TGGAVLEPGS GVGTFMGFAP DAAEMTGVEL DPTTAAISQA LYPEATVRAE 
SFADTKLPTG HFDLAIGNVP FANVTLHDPR HNAGGHSIHN HFILKSLELT RPGGMVAVLT
SSFTLDGTNP SARREMNQLA DLVGAVRLPT GSHRKAAGTD AMTDLLIFRR REPGQEPAST
LWETVTARQI DGTITRLNSY FDEYPARLLG ELHVSHGMYG AETLQLTTDE LSAVPERLNA
ALADVVREAR TAGMMMTERT ADQERQRAAY VPAAAHEWDG HISAADSGFT VVEKGSHTEL
AVPKTQAAEL RALLGLRDSA RLLLTAEAES REDTAGIDGL REELKSAYGR YAETYGPINR
YTLRNTGRVD EETQEPIQAR MTPKAVSIVS RDPFGPLVMA LENFDEATQT AAPAALLSTR
QVQPRRPVLG VDTAEEALTV TLDTVGEVDL DYAASLLGIS ADDARAAMGE SIYQIPGTGE
SFQTRAEYLS GNVREKLEIA QAAALSDDRF AVNVQALTEA MPEPLRMDEV EARLGAVWID
ADTHQEFVRE ILNDPYATVS NAAGSMWDVK ANRHTLAATS NWGTQRMPAS DVLKQVLEQR
PVRVTDEGED KRRILNPTET AAAQEKAQML QERFSEWVWE EPERATRLIG EYNRRFNSIV
LRDYTTEGER LTLPGMAKDF SPRPHQRAAV ARMLSEPAVG LFHQVGAGKT AEMVMGVMEL
RRLGMVNKPA VVIPNHMLEQ FAREWLQIYP QARILAASSN DLAGDKRRQF VARAAANEWD
AVVMTRTAFQ RVSLSPEAEV SYMRAEVAQA RAELEAVRDS GQVNGQPSAS IVKRLEKVVL
GREESLKAKL DTAADPGISF EETGIDYLVV DELHDYKNLE TPSNIPGAGI QGSNRASDLH
MKTEFLRQRE GRRVITGATA TPIANSVTEM YVMQRYLRPD LLQAAGIQDF NTWAATFGQV
VEEMELSVAG GDRFKLKSRF AKFQNVPELL KMFHTFADVK TAEDLKLPVP ALAARDGDGL
RQPNMLTVEP SPELREYIQD IGQRVDRIQQ KLVDAEEDNM LKVSSDGRKA ALDMRLVDPA
LSQQGSTKIS ATADLLASVY EEHKDRIYTD PKTGEPDPVP GALQLVFCDF GTPSDRWNVY
GELKDQLRRR GVPEHMVRFI HEAKNDTEKG RLFAAARSGQ IAVLMGSTSK MGVGTNIQKR
AVHLVDMDAP WRPSDVEQRH GRILRQGNQN SEVRISQVVT KESFDSFMWQ GLERKSRFIN
QIMRGRLDVR EIEDIGDNTL NFAQAKAITS GNPLVMEKAV ADQELARLSR LDRAYNRNMI
AVAHTKRGEQ SAADAAAQDL PLIQAAAART LDTTADAFKA TIEGTFLDNR GDAAEAVRAW
AGKHGHRLMN LYGYDELGTI ATLGGHELRA RLMPARDLDR ATVEIRIEGV PRATTQIARR
SLLSADLGTI RQLENRVSSL PKLAADVEGR RQDALTHVEQ ADKALAEPFK HADALKAALT
DSARINQLMA EAAKPEEPAQ PETPADIDSR MAKMQRLMNA DFPAQPGTAA PPAGGTKAQP
GTVNQQRQHQ TEYER