Gene Arth_2573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2573 
Symbol 
ID4444899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2887484 
End bp2890555 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content69% 
IMG OID639690392 
ProductSMC domain-containing protein 
Protein accessionYP_832052 
Protein GI116671119 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGATCC ACCGGCTCGA GATATCCGCC TTCGGCCCCT TCGCAGGCAC CGAGCACATC 
GACTTTGACC GACTCAGCGC GCACGGGCTC TTCCTGCTGA ACGGCGCAAC CGGCGCCGGC
AAGACCAGCG TGCTGGATGC CATCTGCTTT GCCCTGTACG GGTCCGTGCC CGGTGCACGC
CAGGAGGGCA AGCGCCTCCG CAGCGACCAC GCTGACGCCG CCGCGGAACC GCGCGTCACC
TGCGAGTTTT CAGCCAAGGG GCGGCATTTT GAAGTCTCCA GGATTCCTGC GTGGAACAGG
CCCAGCGCCA GGGGCCGGAA CGGATTTACT GAACAGAAGG CCAACACCCT GCTGCGCGAA
CGCGTTGACG GGCAGTGGAT CGAGAAGTCC GGCCGGAACG ATGAAGCCGG CGCGGAAATC
AGCTCCGTGC TGGGCATGGA CCGTGAGCAG TTCACCCGGG TGGTCATGCT GCCGCAGGGT
GACTTCGCCG CTTTTCTCCG CTCCAAACCG GCCGAGCGGC TGGAACTGCT CCAAAGCCTG
TTCGGCACGG AGCGTTTTGA GGCCGTGGAA CAGGAGCTGG CCCGGCGTGC CGCGGATGCC
CGCGCACAGG TGGCCAGCCT CAACAGCCAG TTGGACCTCC TGCTTGCACA GGCCAGGTCT
GAAGTAACTC CGCCGGAACA GGAACTGCCG GATGTCCCGG CAGCACCGGA GGATGCAGAC
CTCCTTCTCG AATGGCTGCA GGACACCGCC GCGGCCAGGG CCGTGACAGC GCATGCTGAA
GCGGACGAAG CGGCCGCAGG ACGCGCCGGG GCTGCCCGCC GCCTTGAAGC CGCCGAAGCG
CATGCTGCCC GCCAGGTCAA ACTCGCTGCG GCGGAACGCC GGAGATCGGC TGCAGACGCC
GCAGCGCCTG AACTCCGGGA CAAGGCACGG CAACTTGGCC TGCACCGCAA GGCCGAGGTA
CTGGGCGGCC AATTGCAGGC CCTGGACAAG GCCGACATCG CTGAAGAACG GGCCGCCCGG
GCTATGGCAG CGGCGGTCGA CGAACTGCGC GCTGCGGTCC TCGTGGACGC CGAACTCGCA
GCCCTGTCCG CTTACGCCCG CCAGGAAAGC GGCAGCGCGG ACGAGCACTT CTTCGATGCT
GCAGTAGTAC GGAGTGAACT CAGCCGCCTG CGGTCCCTCC GCGCGGTGCT TGAGGAACGG
CTGCCGGACG AGGCCAGGCT GTCCGGGATG GTGGCCCGCG GCGCGGAACT TCGGAAAACC
CTCACCGAGT TGCGTGAGAG AAGGCGGGCC GGCGCTGCCG CCCTTGAAGG TTTGCGGGCG
GAGGCCGCCG AACTGCTAGC GGGCGTGAAG CCCCTCGAGG AACTCGCTGC CGAAGCCCAG
CTGCGGACCA AGGAGGCCGC AGCTGCGGAG GAACTCGTCG CCGTCGTCGG CCGCCACGCT
GCAGCGGTCC GGGTCAGTTC CGGCGTTGCC GAACGGCACC GCCTGGCCCG GGACGACCAC
CAGAACCACC GCCAGCGGTG GCTGGACCTG AGGGAGGAGC GGCTCGCCAA TGCAGCTGCG
GAGCTTGCTT CCCAGCTGCG GCCGACCGAA CCCTGCCCCG TGTGCGGCAG CCCCGAGCAC
CCTTCTCCGG CTCCGGCGGC CACCGCCGCG TTGGCCGTTG CTGACGCGGA ACGCGCGGCC
CAGGAAGCCT GCGAGGCTGC GGAAGCGGTC CTGGCGGCGC TGGGCAAAGA ACTGGCCGAA
GCGCGGCAGC TGGTCGCCGT GCTGGCGGCC CAGGGCGGTG ATCTCCCGCT GGAGGAAGCC
CGCGCAGACG CGGCCCAGGC GAAGGAGCGA GCGGACGAGG CAGTCAGGGC GGCCGCGGAC
CTCGCCGCCA GCCGTGAGCG CCAGGCAGAA CTGGACGAAC ACATCGATGC TGCCGAATCG
GCCCAGGCCG CTGCCGATTC CGGAATGGCG AAGACTGAAT CCACCCTCAT GGAAGTCCTG
GAACAGACGG ACGCCCTGGA TGATGCGCTG GGCAAACTGC GTGCGGGCTA CCCGACCCTC
GGCAGCCGCC TGAGCTCCCT CGACGAATCC ACGGCGCTCC TGGAACGAAC AGACGCCGCG
AGGAGCGGCC TCGAACAGGC CGGGCTGCGC ACCAGGGATG CCCGCCAGCA GTTGGACAAG
GCGCTTCCGG AGTCCGGGTT CGAATCAGCC GCGGCGGCAC GGTCCGTCCT GCTCCCGGTC
CCGGAAGCAG CCATGCTCGA AGCCGCGATC CGGGCCGGCC AAGACGAAGA AGCCCGCGTC
GGGGAACTCT TCGCCAGCGA GGAACTGATC CTTGCCACAC GCGAGCTTGA GGACGACGGC
CCGGTAGAGG CCAGCGTGCT GGAACAGCTT CGCGCGGAGG ACGCTGCCGC TGAACGAATG
GCCAGGCAAG CCGTTGTCGC GGCGGGGCTC GCCGAGAAGT CAGTCCTTAC CCTCCGTCGG
ATCGCGGAGG ACTACGGCCG GCTGGCGGCA TCGGGCCAAG GTCCGCGGGA ACGTGCCGCG
CTGCTCACGG CGGTGGCCGA GGCCGCCCGC GGCGCCGGCG ACAACACCTA CCGCATGAGC
CTGAACAGCT ACGTACTCGC GGCGAGGCTC GAGCAAGTGG CCATTGCCGC TTCGGAGAGG
CTGGTCGGCA TGAGCGATGG CCGGTACACC CTGCAGCACA CGGACGCCAA GGCCGCCCGC
GGTGCCAAAT CCGGTCTTGG CCTGGAAGTC GTGGACCAGT GGACCGGTCA CCGCCGGGAT
ACCGCCACGC TGTCCGGCGG TGAATCTTTC ATGGCTTCCC TGGCCCTGGC GCTGGGTCTG
GCGGATGTGG TGCAACAGGA GTCCGGCGGA GTGGACATCG AGACACTCTT CGTGGACGAG
GGCTTCGGCA GCCTCGACGA GCAGGCGTTG GAACAAGTGA TGGATGCCCT TGAGGGGCTT
CGGGACGGCG GCCGTGTGGT CGGCCTGGTG AGCCATGTGC CCGAGATGAA GCAGCGCATC
AGCACCCAGC TCCAGGTGGT CAAGGGGCGG AACGGTTCCA CTCTCCATAT TTCGGACGAC
GCCCTGGCCT GA
 
Protein sequence
MRIHRLEISA FGPFAGTEHI DFDRLSAHGL FLLNGATGAG KTSVLDAICF ALYGSVPGAR 
QEGKRLRSDH ADAAAEPRVT CEFSAKGRHF EVSRIPAWNR PSARGRNGFT EQKANTLLRE
RVDGQWIEKS GRNDEAGAEI SSVLGMDREQ FTRVVMLPQG DFAAFLRSKP AERLELLQSL
FGTERFEAVE QELARRAADA RAQVASLNSQ LDLLLAQARS EVTPPEQELP DVPAAPEDAD
LLLEWLQDTA AARAVTAHAE ADEAAAGRAG AARRLEAAEA HAARQVKLAA AERRRSAADA
AAPELRDKAR QLGLHRKAEV LGGQLQALDK ADIAEERAAR AMAAAVDELR AAVLVDAELA
ALSAYARQES GSADEHFFDA AVVRSELSRL RSLRAVLEER LPDEARLSGM VARGAELRKT
LTELRERRRA GAAALEGLRA EAAELLAGVK PLEELAAEAQ LRTKEAAAAE ELVAVVGRHA
AAVRVSSGVA ERHRLARDDH QNHRQRWLDL REERLANAAA ELASQLRPTE PCPVCGSPEH
PSPAPAATAA LAVADAERAA QEACEAAEAV LAALGKELAE ARQLVAVLAA QGGDLPLEEA
RADAAQAKER ADEAVRAAAD LAASRERQAE LDEHIDAAES AQAAADSGMA KTESTLMEVL
EQTDALDDAL GKLRAGYPTL GSRLSSLDES TALLERTDAA RSGLEQAGLR TRDARQQLDK
ALPESGFESA AAARSVLLPV PEAAMLEAAI RAGQDEEARV GELFASEELI LATRELEDDG
PVEASVLEQL RAEDAAAERM ARQAVVAAGL AEKSVLTLRR IAEDYGRLAA SGQGPRERAA
LLTAVAEAAR GAGDNTYRMS LNSYVLAARL EQVAIAASER LVGMSDGRYT LQHTDAKAAR
GAKSGLGLEV VDQWTGHRRD TATLSGGESF MASLALALGL ADVVQQESGG VDIETLFVDE
GFGSLDEQAL EQVMDALEGL RDGGRVVGLV SHVPEMKQRI STQLQVVKGR NGSTLHISDD
ALA