Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2573 |
Symbol | |
ID | 4444899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2887484 |
End bp | 2890555 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639690392 |
Product | SMC domain-containing protein |
Protein accession | YP_832052 |
Protein GI | 116671119 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGATCC ACCGGCTCGA GATATCCGCC TTCGGCCCCT TCGCAGGCAC CGAGCACATC GACTTTGACC GACTCAGCGC GCACGGGCTC TTCCTGCTGA ACGGCGCAAC CGGCGCCGGC AAGACCAGCG TGCTGGATGC CATCTGCTTT GCCCTGTACG GGTCCGTGCC CGGTGCACGC CAGGAGGGCA AGCGCCTCCG CAGCGACCAC GCTGACGCCG CCGCGGAACC GCGCGTCACC TGCGAGTTTT CAGCCAAGGG GCGGCATTTT GAAGTCTCCA GGATTCCTGC GTGGAACAGG CCCAGCGCCA GGGGCCGGAA CGGATTTACT GAACAGAAGG CCAACACCCT GCTGCGCGAA CGCGTTGACG GGCAGTGGAT CGAGAAGTCC GGCCGGAACG ATGAAGCCGG CGCGGAAATC AGCTCCGTGC TGGGCATGGA CCGTGAGCAG TTCACCCGGG TGGTCATGCT GCCGCAGGGT GACTTCGCCG CTTTTCTCCG CTCCAAACCG GCCGAGCGGC TGGAACTGCT CCAAAGCCTG TTCGGCACGG AGCGTTTTGA GGCCGTGGAA CAGGAGCTGG CCCGGCGTGC CGCGGATGCC CGCGCACAGG TGGCCAGCCT CAACAGCCAG TTGGACCTCC TGCTTGCACA GGCCAGGTCT GAAGTAACTC CGCCGGAACA GGAACTGCCG GATGTCCCGG CAGCACCGGA GGATGCAGAC CTCCTTCTCG AATGGCTGCA GGACACCGCC GCGGCCAGGG CCGTGACAGC GCATGCTGAA GCGGACGAAG CGGCCGCAGG ACGCGCCGGG GCTGCCCGCC GCCTTGAAGC CGCCGAAGCG CATGCTGCCC GCCAGGTCAA ACTCGCTGCG GCGGAACGCC GGAGATCGGC TGCAGACGCC GCAGCGCCTG AACTCCGGGA CAAGGCACGG CAACTTGGCC TGCACCGCAA GGCCGAGGTA CTGGGCGGCC AATTGCAGGC CCTGGACAAG GCCGACATCG CTGAAGAACG GGCCGCCCGG GCTATGGCAG CGGCGGTCGA CGAACTGCGC GCTGCGGTCC TCGTGGACGC CGAACTCGCA GCCCTGTCCG CTTACGCCCG CCAGGAAAGC GGCAGCGCGG ACGAGCACTT CTTCGATGCT GCAGTAGTAC GGAGTGAACT CAGCCGCCTG CGGTCCCTCC GCGCGGTGCT TGAGGAACGG CTGCCGGACG AGGCCAGGCT GTCCGGGATG GTGGCCCGCG GCGCGGAACT TCGGAAAACC CTCACCGAGT TGCGTGAGAG AAGGCGGGCC GGCGCTGCCG CCCTTGAAGG TTTGCGGGCG GAGGCCGCCG AACTGCTAGC GGGCGTGAAG CCCCTCGAGG AACTCGCTGC CGAAGCCCAG CTGCGGACCA AGGAGGCCGC AGCTGCGGAG GAACTCGTCG CCGTCGTCGG CCGCCACGCT GCAGCGGTCC GGGTCAGTTC CGGCGTTGCC GAACGGCACC GCCTGGCCCG GGACGACCAC CAGAACCACC GCCAGCGGTG GCTGGACCTG AGGGAGGAGC GGCTCGCCAA TGCAGCTGCG GAGCTTGCTT CCCAGCTGCG GCCGACCGAA CCCTGCCCCG TGTGCGGCAG CCCCGAGCAC CCTTCTCCGG CTCCGGCGGC CACCGCCGCG TTGGCCGTTG CTGACGCGGA ACGCGCGGCC CAGGAAGCCT GCGAGGCTGC GGAAGCGGTC CTGGCGGCGC TGGGCAAAGA ACTGGCCGAA GCGCGGCAGC TGGTCGCCGT GCTGGCGGCC CAGGGCGGTG ATCTCCCGCT GGAGGAAGCC CGCGCAGACG CGGCCCAGGC GAAGGAGCGA GCGGACGAGG CAGTCAGGGC GGCCGCGGAC CTCGCCGCCA GCCGTGAGCG CCAGGCAGAA CTGGACGAAC ACATCGATGC TGCCGAATCG GCCCAGGCCG CTGCCGATTC CGGAATGGCG AAGACTGAAT CCACCCTCAT GGAAGTCCTG GAACAGACGG ACGCCCTGGA TGATGCGCTG GGCAAACTGC GTGCGGGCTA CCCGACCCTC GGCAGCCGCC TGAGCTCCCT CGACGAATCC ACGGCGCTCC TGGAACGAAC AGACGCCGCG AGGAGCGGCC TCGAACAGGC CGGGCTGCGC ACCAGGGATG CCCGCCAGCA GTTGGACAAG GCGCTTCCGG AGTCCGGGTT CGAATCAGCC GCGGCGGCAC GGTCCGTCCT GCTCCCGGTC CCGGAAGCAG CCATGCTCGA AGCCGCGATC CGGGCCGGCC AAGACGAAGA AGCCCGCGTC GGGGAACTCT TCGCCAGCGA GGAACTGATC CTTGCCACAC GCGAGCTTGA GGACGACGGC CCGGTAGAGG CCAGCGTGCT GGAACAGCTT CGCGCGGAGG ACGCTGCCGC TGAACGAATG GCCAGGCAAG CCGTTGTCGC GGCGGGGCTC GCCGAGAAGT CAGTCCTTAC CCTCCGTCGG ATCGCGGAGG ACTACGGCCG GCTGGCGGCA TCGGGCCAAG GTCCGCGGGA ACGTGCCGCG CTGCTCACGG CGGTGGCCGA GGCCGCCCGC GGCGCCGGCG ACAACACCTA CCGCATGAGC CTGAACAGCT ACGTACTCGC GGCGAGGCTC GAGCAAGTGG CCATTGCCGC TTCGGAGAGG CTGGTCGGCA TGAGCGATGG CCGGTACACC CTGCAGCACA CGGACGCCAA GGCCGCCCGC GGTGCCAAAT CCGGTCTTGG CCTGGAAGTC GTGGACCAGT GGACCGGTCA CCGCCGGGAT ACCGCCACGC TGTCCGGCGG TGAATCTTTC ATGGCTTCCC TGGCCCTGGC GCTGGGTCTG GCGGATGTGG TGCAACAGGA GTCCGGCGGA GTGGACATCG AGACACTCTT CGTGGACGAG GGCTTCGGCA GCCTCGACGA GCAGGCGTTG GAACAAGTGA TGGATGCCCT TGAGGGGCTT CGGGACGGCG GCCGTGTGGT CGGCCTGGTG AGCCATGTGC CCGAGATGAA GCAGCGCATC AGCACCCAGC TCCAGGTGGT CAAGGGGCGG AACGGTTCCA CTCTCCATAT TTCGGACGAC GCCCTGGCCT GA
|
Protein sequence | MRIHRLEISA FGPFAGTEHI DFDRLSAHGL FLLNGATGAG KTSVLDAICF ALYGSVPGAR QEGKRLRSDH ADAAAEPRVT CEFSAKGRHF EVSRIPAWNR PSARGRNGFT EQKANTLLRE RVDGQWIEKS GRNDEAGAEI SSVLGMDREQ FTRVVMLPQG DFAAFLRSKP AERLELLQSL FGTERFEAVE QELARRAADA RAQVASLNSQ LDLLLAQARS EVTPPEQELP DVPAAPEDAD LLLEWLQDTA AARAVTAHAE ADEAAAGRAG AARRLEAAEA HAARQVKLAA AERRRSAADA AAPELRDKAR QLGLHRKAEV LGGQLQALDK ADIAEERAAR AMAAAVDELR AAVLVDAELA ALSAYARQES GSADEHFFDA AVVRSELSRL RSLRAVLEER LPDEARLSGM VARGAELRKT LTELRERRRA GAAALEGLRA EAAELLAGVK PLEELAAEAQ LRTKEAAAAE ELVAVVGRHA AAVRVSSGVA ERHRLARDDH QNHRQRWLDL REERLANAAA ELASQLRPTE PCPVCGSPEH PSPAPAATAA LAVADAERAA QEACEAAEAV LAALGKELAE ARQLVAVLAA QGGDLPLEEA RADAAQAKER ADEAVRAAAD LAASRERQAE LDEHIDAAES AQAAADSGMA KTESTLMEVL EQTDALDDAL GKLRAGYPTL GSRLSSLDES TALLERTDAA RSGLEQAGLR TRDARQQLDK ALPESGFESA AAARSVLLPV PEAAMLEAAI RAGQDEEARV GELFASEELI LATRELEDDG PVEASVLEQL RAEDAAAERM ARQAVVAAGL AEKSVLTLRR IAEDYGRLAA SGQGPRERAA LLTAVAEAAR GAGDNTYRMS LNSYVLAARL EQVAIAASER LVGMSDGRYT LQHTDAKAAR GAKSGLGLEV VDQWTGHRRD TATLSGGESF MASLALALGL ADVVQQESGG VDIETLFVDE GFGSLDEQAL EQVMDALEGL RDGGRVVGLV SHVPEMKQRI STQLQVVKGR NGSTLHISDD ALA
|
| |