Gene Arth_2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2063 
Symbol 
ID4445402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2323112 
End bp2325979 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content64% 
IMG OID639689871 
ProductDNA polymerase I 
Protein accessionYP_831543 
Protein GI116670610 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.045949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACTA GGCTGGCACC TGTGAGCGAA ACTACCAAAC CGGCCCTTGC CCCCCTTTCG 
GCAGACACTG CTGCAGACAT CCTCCCGGCA GCTGAGGCCG AATCAGCGAC TGCGGCCAAA
CCCGCCCGGA AAACCACACG CACCGCGGAG CCGGTTTCCG CCACCGAGGC CCCCGTGATT
CCCATCACCG ACCAGCCGCG CCTCCTGGTG CTGGACGGGC ACTCGATGGC CTTCCGGGCG
TTCTTCGCGC TGCCCGCGGA CAAGTTTTCC ACCGCCGCGG GGCAGCACAC CAACGCGATC
CACGGTTTCA CGTCCATGCT CATCAACCTG ATCAAGGAAC AGAAGCCCAC GCACGTAGCT
GTGGCGTTCG ACGTCTCTGA CGAGACCACC CACCGGAAGG CCGAGTACAG CGAGTACAAG
GGCGGCCGCA ACGAAACTCC CCGTGAGATG AGCGGCCAGA TCGACCTTAT TGACAAGGTC
ATGGGCGCGT GGGGCATCAA GACCATCAAG ATGCCCGGCT ACGAAGCGGA TGACGTCCTG
GCTACGCTCG CCGCCATGGG GGAAAAGGCA GGCTACGAAG TGCTGCTGGT GACGGGCGAC
CGCGACGCCT TCCAGCTGAT TACGGACAAC GTCTTTGTGC TCTACCCCAG GAAGGGCGTC
AGCGACATTC CCCGCCTGGA TGCCCCCGCC ATCCAGGAAA AGTACTTTGT CAGCCCCGCC
CAGTATTCGG ACCTGGCCGC CCTGGTGGGC GAGTCTGCGG ACAACCTTCC CGGCGTGCCC
GGCGTTGGTC CCAAGACGGC CGCCAAGTGG ATCAACCTCT ACGGCGGCCT GGAAGGCGTG
CTGGAACACC TCGACAAGAT CGGCGGCAAG GTGGGCGACT CCCTCCGCGA AAACGTGGAC
GCCGTGAAGC GGAACCGCCG GTTGAACAGG CTCCACACCG ACCTCGAGCT TCCCGTGACC
CTGGACGACC TCGCCGACCC CCGGCCGGAC CAGGCGGCGC TGGAACAGCT GTTCGACGAG
CTGGAGTTTA AGACCATCCG CACCCGGCTC TTTGCCCTCT ACGGCGCCGA AGAGGTGGAC
CTCGCCGAGC GTGAGAGCCT CGACATCCCG GATTACAGCA CGCCCGCCGG AGCAGCCGAG
CTGAGCGCAT TCCTGGCTGC GGGCGCCGGT CAGCGGTCCG CCGTCGCCGT CGACCTTGTT
CCCGGACGCA TCGGCGAGGA TGCCGCCGCG CTGGCAATCG TCCGGGACGG AGCCGCGGTA
TACATCGACC TTTCCGGCCA GGATGCCGAG GCCGAAAACG TCCTGGCGGC CTGGCTGCGC
GACCCGGAAG CGCCCAAGGT CATGCACGGC TTCAAGGCCG CCCTCAAGGC CCTGAGCGCC
CGCGGACTGG AACTGGAAGG CGTCGTCGAC GATACGTCGA TTTCCGGTTA CCTCATCCAG
CCCGACCGCC GCACTTACGA GCTCGCGGAG CTGGCGCAGC ACCACCTCAA CATCGAAATC
TCCACCGCCG TGGCGAAGGC CGGGCAGCTG GAATTGTCGT TCGACGGCGA TGACTCCGCC
GCCGCCGGTG AACTCGTCCA CGCTGCCGCC GTCGTCCACG CCCTCAGCCG CTACTTCGAG
GCGGAACTGA AGGAGCGCAA GGCCGAGGAG CTGCTGTCCA CGCTGGAACT TCCGGTGAGC
CAGGTACTGG CGGATATGGA ACTCGCCGGA ATCGCCATCG ACATGCAGCG GATGGATGAG
CAGCTGGCCG ACCTTGCCAA GGTGATCGAC AACGCCCAGG AACTTGCCTT CGCCGCCATC
GGACACGAGG TCAACCTCGG ATCACCCAAA CAGCTGCAGA CCGTGCTGTT CGACGAACTC
CAGCTGCCCA AGACCAAGAA GATCAAGTCC GGTTACACCA CCGACGCCGC ATCGCTCAAG
AACCTCCTGG AAAAGACCGG GCACGAATTC CTGGTCCAGC TGATGGCGCA CCGGGAATCC
TCGAAACTCC GCCAGATGCT GGAGTCGCTG AAGAAGTCCG TCGCCGAGGA CGGCCGCATC
CACACCACCT ATGCGCAAAA TGTCGCAGCC ACCGGCCGTA TCTCGTCCAA CAACCCCAAC
CTGCAGAACA TCCCCATCCG GAGCGAAGAG GGCCGGCGCG TCCGTGGCAT CTTCGTGGTC
AGCGAGGGCT ATGAATGCCT CCTTTCCGCG GACTATTCGC AGATCGAGAT GCGGATCATG
GCCCACCTCT CGGGGGACGC CGGCCTGATC CAGGCCTACC GGGACGGCGA AGACCTTCAC
CGGTTTGTGG GATCGAACAT CTTCCACGTG CCCACCGACC AGGTCACAAG TGCCATGCGG
TCCAAGGTCA AGGCGATGTC CTACGGCCTG GCCTACGGCC TGACCTCGTT CGGACTGTCC
AAGCAGCTGG AAATTTCTGT TGACGAGGCC CGGACATTGA TGAAGGAATA CTTCGACCGC
TTCGGAGCCG TGCGCGACTA CCTCCGCGGC GTGGTGGACC AGGCCCGGAT CGACGGCTTC
ACGGCCACCA TCGAGGGGCG CCGCCGTTAC CTGCCGGACC TCACCAGCAC GGACCGCCAG
CTGCGCGAGA ACGCGGAACG CATTGCGCTC AACTCACCCA TCCAGGGTTC CGCGGCGGAC
ATCATCAAAC GGGCCATGCT GGGCGTGCAT GCTGAATTGA AGGCCCAGGG CCTCAAATCA
CGGATGCTCC TGCAGGTCCA TGACGAACTG GTGCTTGAAG TTGCCGCCGG TGAACGGGAA
GCGGTGGAAA AGCTGGTGAC GGAGCAGATG GGCTCCGCCG CGGACCTCAG CGTGCCGCTG
GACGTCCAGA TCGGCGTCGG GCCCAGCTGG TACGACGCCG GTCACTAA
 
Protein sequence
MGTRLAPVSE TTKPALAPLS ADTAADILPA AEAESATAAK PARKTTRTAE PVSATEAPVI 
PITDQPRLLV LDGHSMAFRA FFALPADKFS TAAGQHTNAI HGFTSMLINL IKEQKPTHVA
VAFDVSDETT HRKAEYSEYK GGRNETPREM SGQIDLIDKV MGAWGIKTIK MPGYEADDVL
ATLAAMGEKA GYEVLLVTGD RDAFQLITDN VFVLYPRKGV SDIPRLDAPA IQEKYFVSPA
QYSDLAALVG ESADNLPGVP GVGPKTAAKW INLYGGLEGV LEHLDKIGGK VGDSLRENVD
AVKRNRRLNR LHTDLELPVT LDDLADPRPD QAALEQLFDE LEFKTIRTRL FALYGAEEVD
LAERESLDIP DYSTPAGAAE LSAFLAAGAG QRSAVAVDLV PGRIGEDAAA LAIVRDGAAV
YIDLSGQDAE AENVLAAWLR DPEAPKVMHG FKAALKALSA RGLELEGVVD DTSISGYLIQ
PDRRTYELAE LAQHHLNIEI STAVAKAGQL ELSFDGDDSA AAGELVHAAA VVHALSRYFE
AELKERKAEE LLSTLELPVS QVLADMELAG IAIDMQRMDE QLADLAKVID NAQELAFAAI
GHEVNLGSPK QLQTVLFDEL QLPKTKKIKS GYTTDAASLK NLLEKTGHEF LVQLMAHRES
SKLRQMLESL KKSVAEDGRI HTTYAQNVAA TGRISSNNPN LQNIPIRSEE GRRVRGIFVV
SEGYECLLSA DYSQIEMRIM AHLSGDAGLI QAYRDGEDLH RFVGSNIFHV PTDQVTSAMR
SKVKAMSYGL AYGLTSFGLS KQLEISVDEA RTLMKEYFDR FGAVRDYLRG VVDQARIDGF
TATIEGRRRY LPDLTSTDRQ LRENAERIAL NSPIQGSAAD IIKRAMLGVH AELKAQGLKS
RMLLQVHDEL VLEVAAGERE AVEKLVTEQM GSAADLSVPL DVQIGVGPSW YDAGH