Gene Arth_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3798 
Symbol 
ID4447724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4282479 
End bp4285943 
Gene Length3465 bp 
Protein Length1154 aa 
Translation table11 
GC content67% 
IMG OID639691622 
ProductSNF2-related protein 
Protein accessionYP_833273 
Protein GI116672340 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCCC AACCGGACGA GAGTGCGGCA CTGGCAATAC AGACCCCCGC GATCAACGAC 
CGCTCCCTCG CGGCCGGACT CGCCTACGCC ATGGGCAGCC GGGTCTCGGG GATTTCCTTC
GATGCCGCCA CCGGCCTGAT GCTCGGCAAG GTCCGCGGCG GTTCGGACGC GCCCTATTCA
ACCACTGCCA AGCTGGTCCG GAAGGCGGGC GGCTGGAGCT GCACCGTGGG CGTCTGCAGC
TGCCCGGTGC GGAAGGACTG CAAGCATGTG GCCGCGCTGC TGTTCGCGGC CGAGGACAGT
CCCGCCATCC GCGTCCAGCT CCTGGCCCCG GCCGAGGCCA CCCGCCTGTC CCGTGAACCC
CTGGCGCAGG ACATGCCGGA CTGGGAGCAG GCCCTGGGCC CGCTCATTGC CCGCCCTGGA
ATCACCCCGT CCACCAACGG CATCCCGCTG GCCCTCCAGT TCGACATTGA AGAGCCGGCC
CCGCACTTTT CCTACACCGG CCGCCGTGAT CCGCTCCGCA GCGTCCGCCA GCTGAAGGCG
CGCCCGGTGA TCATGGGCGC CAAGGGCAAG TGGATCCGCG GCGACGTCTC CTGGAACACC
CTGAGCTACC TCAACTACCG GCGTGAATGC AACGAGGCCC ACGTGGAGTG GATGCAGGAA
TTCCTTGCCT CCCACACCGC CCTCGCGAAC CGGCAGCACA ACTCCACCGC GCCCTGGCTG
GGGCTGAACA CCTACGCCGG GAAAAACCTG TGGAGCCTGC TCGCCCAGGC CCGGAAGATC
GGCGTCGCGC TGGTGCATGC GGGCAGCCAG GAACCCGTGC GGTTTGTGGA GGAGCCGGCC
GCGGTGGGAC TTAACCTGAC CCGTTTCGGC GCGTCCGGCG GGGAAACGGG CGGCGGCGCC
GCCGTCGAAC GTGCCGCCGT CGGGCCGGCC GCCGCCGTCA ACGCGAAGAC GTCCGACGAC
GGCGGGCTGA CGCTGGCGCC CACCATGACC GTTGAAGGGG AAGTGGTTGA CCCGGCGTCC
GTGGGGACCA TCGGCCGGCC GGCCCACGGG ATCTTCCTGA CGTCCGGAGC GGACGCTCTT
CCCGGCACGG GCAACGGATC GATCATCACG CTGGCTCCGC TGGAAAGCGG GCTTAGCGAA
GAACTCCTGA CATTCGTCAC GGCCGGCAGC ACGCTGCACA TTCCGGCGCG GGACGAGACC
CGCTTCCTCA CCGGCTTCTA CCCCAAGCTC AAGCAGGCCG CCCGCGTCAC CGCGTCGGAC
CAGTCGGTGG AACTGCCCAC GCTGGCCGTC CCCACCTTGT CGCTCCTGGC CAACTACGGT
GCCGACCACA AAGTCCGGCT GCACTGGGAA TGGCACTACA CGTCAGGGAA CCTGGTCACG
GCGCAGCCCC TCTGGCGGCA TCCGGGCGAC CACGGCTACC GTGACGACCC CACGGAGGCC
CGCATCCTCG AGGCCGTGGG ACAGCCCTGG GAAGTTGTCC CCAAGCTGGG CGAATCCGCC
ACCGGAGGCT GGGGAACTCC GCGGCTTGCT GCTTCTGCCG AGCTGAGCGG GCTGGACACG
CTTGCCTTTA CCGAGGAAGT GCTTCCCCGG CTCCGCAACA CCCCGGACGT GGTGGTGGAC
ACCTCAGGTG AGATCGCCGA CTACCGGGAG GCCGAGGAAG CCCCCGTGGT GGCCATCTCC
ACCAAGGCCA CGGACCAGCG CGACTGGTTC GATCTCGGTA TCCAGATCTC CCTTGAAGGC
CAGCCCGTTT CCTTCGCAGC GGTGTTCTCC GCCCTCGCCG CCGGCCAGAC GAAGATGCTC
CTGCCCAGCG GCGCTTACTT TTCGCTGGAC CTTCCGGAAC TCCATCAGCT GCGGGCCCTG
ATCGAGGAGG CCCGGTCCCT TCAGGACAAC AAGGACGCGC CGCTGCAGAT CAGCCGGTTC
CAGGCCGGGC TTTGGGATGA ACTGGCGCAC CTTGGCGTTG TCGATGAGCA GGCGGCCGCC
TGGCGCGAGG CCGTGGGCGG ACTGCTGGAA GGCGGCATCA AGGGGCTGCC CCTACCGCCA
ACCCTGAATG CCGAGCTTCG CCCCTACCAG TTGGAGGGTT TCAACTGGCT GAGTTTCCTC
TACCGGCACG GCTTGGGCGG GGTGCTGGCC GATGACATGG GCCTGGGCAA GACCGTCCAG
GCCCTCGCGC TGGTTTGCGC GGCTAAGGAA GCCGCCGCGT CTGAGGGGAA AGCGGCCGAC
GGCGCCGCGC CCTTTCTCGT CGTGGCCCCC ACCAGCGTGG TGGGCAACTG GGAGGCCGAA
ACCGCGCGGT TCGCCCCCGG GCTTACCGTC CGCGCCATCG GCGAGACGTT CGCGAAAAAC
GGGGTGGACC CGGCCGAGGC CATGGCCGGC GCGGACATCG TGATTACGTC CTACGCCCTG
TTCCGGATCG ACTACGAAGC GTATGCGTCA CGGGAATGGG CCGCCCTGGT GCTGGACGAG
GCGCAGTTCG TGAAGAACCA CCAGTCCAAG GCCTACCAGT GCGCGCGCAA GCTGCCGGCG
GCGTTCAAAC TGGCCATCAC CGGAACACCG CTGGAAAACA ACCTGATGGA GTTCTGGGCG
CTCACCTCGA TCGTGGCGCC GGGCCTGTTC GCGAGCCCCA GCCGGTTCGC CGAGTACTAC
CAGAAGCCGG TGGAAAAGAA CGGCGACAAG GGGCAGTTGG ACAAGCTCCG CCGTCGGGTC
CGTCCACTCA TGATGCGGCG CACCAAGGAA CAGGTCATCC ACGACCTGCC GCCCAAGCAG
GAGCAGATCC TCGAGGTGGT GCTGAATCCG CGGCACCAGA AGGTCTACCA GACGCACCTG
CAGCGGGAGC GCCAGAAGAT CCTGGGCCTC ATTGAGGACG TCAACAAAAA CCGCTTCACC
ATTTTCCAGT CGCTCACCCT GCTCCGTCAG CTCAGCCTGG ACGCATCGCT GGTGGACCCG
TCCCTGTCCG GCGTGCGGTC CAGCAAGCTG GACGTGCTCT TCGAGCAGCT TGAGGACCTC
GTGGCCGAAG GGCACCGTGC CTTGATCTTC AGCCAGTTCA CCGGCTTCCT GGGCAAGGTC
CGTGAACGGC TCGTCGAGGA GAAGATCGAA TTCTGCTACC TCGACGGCAG CACCCGCAAC
CGCTCCGACG TGGTGAACGA GTTCAAGAAC GGCTCCGCCC CCGTGTTCCT GATCAGCCTG
AAGGCCGGCG GCTTCGGCCT GAACCTGACA GAGGCGGACT ACGTGTTCCT GCTGGACCCC
TGGTGGAACC CGGCGTCCGA GGCGCAGGCG GTGGACCGTA CGCACCGGAT CGGCCAGGCC
CGCAACGTGA TGGTCTACCG GCTGGTTGCC AAGGACACCA TCGAGGAAAA GGTCATGGCC
TTGAAGGCCA AGAAGTCGCA GCTGTTCGCG GACGTCATGG AGGGCGACGC GCTCTCCGGC
GGTTCCCTGA CGGCCGATGA CTTGGCCGGC CTGTTCAACG ACTGA
 
Protein sequence
MPSQPDESAA LAIQTPAIND RSLAAGLAYA MGSRVSGISF DAATGLMLGK VRGGSDAPYS 
TTAKLVRKAG GWSCTVGVCS CPVRKDCKHV AALLFAAEDS PAIRVQLLAP AEATRLSREP
LAQDMPDWEQ ALGPLIARPG ITPSTNGIPL ALQFDIEEPA PHFSYTGRRD PLRSVRQLKA
RPVIMGAKGK WIRGDVSWNT LSYLNYRREC NEAHVEWMQE FLASHTALAN RQHNSTAPWL
GLNTYAGKNL WSLLAQARKI GVALVHAGSQ EPVRFVEEPA AVGLNLTRFG ASGGETGGGA
AVERAAVGPA AAVNAKTSDD GGLTLAPTMT VEGEVVDPAS VGTIGRPAHG IFLTSGADAL
PGTGNGSIIT LAPLESGLSE ELLTFVTAGS TLHIPARDET RFLTGFYPKL KQAARVTASD
QSVELPTLAV PTLSLLANYG ADHKVRLHWE WHYTSGNLVT AQPLWRHPGD HGYRDDPTEA
RILEAVGQPW EVVPKLGESA TGGWGTPRLA ASAELSGLDT LAFTEEVLPR LRNTPDVVVD
TSGEIADYRE AEEAPVVAIS TKATDQRDWF DLGIQISLEG QPVSFAAVFS ALAAGQTKML
LPSGAYFSLD LPELHQLRAL IEEARSLQDN KDAPLQISRF QAGLWDELAH LGVVDEQAAA
WREAVGGLLE GGIKGLPLPP TLNAELRPYQ LEGFNWLSFL YRHGLGGVLA DDMGLGKTVQ
ALALVCAAKE AAASEGKAAD GAAPFLVVAP TSVVGNWEAE TARFAPGLTV RAIGETFAKN
GVDPAEAMAG ADIVITSYAL FRIDYEAYAS REWAALVLDE AQFVKNHQSK AYQCARKLPA
AFKLAITGTP LENNLMEFWA LTSIVAPGLF ASPSRFAEYY QKPVEKNGDK GQLDKLRRRV
RPLMMRRTKE QVIHDLPPKQ EQILEVVLNP RHQKVYQTHL QRERQKILGL IEDVNKNRFT
IFQSLTLLRQ LSLDASLVDP SLSGVRSSKL DVLFEQLEDL VAEGHRALIF SQFTGFLGKV
RERLVEEKIE FCYLDGSTRN RSDVVNEFKN GSAPVFLISL KAGGFGLNLT EADYVFLLDP
WWNPASEAQA VDRTHRIGQA RNVMVYRLVA KDTIEEKVMA LKAKKSQLFA DVMEGDALSG
GSLTADDLAG LFND