Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2063 |
Symbol | |
ID | 4445402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2323112 |
End bp | 2325979 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639689871 |
Product | DNA polymerase I |
Protein accession | YP_831543 |
Protein GI | 116670610 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.045949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAACTA GGCTGGCACC TGTGAGCGAA ACTACCAAAC CGGCCCTTGC CCCCCTTTCG GCAGACACTG CTGCAGACAT CCTCCCGGCA GCTGAGGCCG AATCAGCGAC TGCGGCCAAA CCCGCCCGGA AAACCACACG CACCGCGGAG CCGGTTTCCG CCACCGAGGC CCCCGTGATT CCCATCACCG ACCAGCCGCG CCTCCTGGTG CTGGACGGGC ACTCGATGGC CTTCCGGGCG TTCTTCGCGC TGCCCGCGGA CAAGTTTTCC ACCGCCGCGG GGCAGCACAC CAACGCGATC CACGGTTTCA CGTCCATGCT CATCAACCTG ATCAAGGAAC AGAAGCCCAC GCACGTAGCT GTGGCGTTCG ACGTCTCTGA CGAGACCACC CACCGGAAGG CCGAGTACAG CGAGTACAAG GGCGGCCGCA ACGAAACTCC CCGTGAGATG AGCGGCCAGA TCGACCTTAT TGACAAGGTC ATGGGCGCGT GGGGCATCAA GACCATCAAG ATGCCCGGCT ACGAAGCGGA TGACGTCCTG GCTACGCTCG CCGCCATGGG GGAAAAGGCA GGCTACGAAG TGCTGCTGGT GACGGGCGAC CGCGACGCCT TCCAGCTGAT TACGGACAAC GTCTTTGTGC TCTACCCCAG GAAGGGCGTC AGCGACATTC CCCGCCTGGA TGCCCCCGCC ATCCAGGAAA AGTACTTTGT CAGCCCCGCC CAGTATTCGG ACCTGGCCGC CCTGGTGGGC GAGTCTGCGG ACAACCTTCC CGGCGTGCCC GGCGTTGGTC CCAAGACGGC CGCCAAGTGG ATCAACCTCT ACGGCGGCCT GGAAGGCGTG CTGGAACACC TCGACAAGAT CGGCGGCAAG GTGGGCGACT CCCTCCGCGA AAACGTGGAC GCCGTGAAGC GGAACCGCCG GTTGAACAGG CTCCACACCG ACCTCGAGCT TCCCGTGACC CTGGACGACC TCGCCGACCC CCGGCCGGAC CAGGCGGCGC TGGAACAGCT GTTCGACGAG CTGGAGTTTA AGACCATCCG CACCCGGCTC TTTGCCCTCT ACGGCGCCGA AGAGGTGGAC CTCGCCGAGC GTGAGAGCCT CGACATCCCG GATTACAGCA CGCCCGCCGG AGCAGCCGAG CTGAGCGCAT TCCTGGCTGC GGGCGCCGGT CAGCGGTCCG CCGTCGCCGT CGACCTTGTT CCCGGACGCA TCGGCGAGGA TGCCGCCGCG CTGGCAATCG TCCGGGACGG AGCCGCGGTA TACATCGACC TTTCCGGCCA GGATGCCGAG GCCGAAAACG TCCTGGCGGC CTGGCTGCGC GACCCGGAAG CGCCCAAGGT CATGCACGGC TTCAAGGCCG CCCTCAAGGC CCTGAGCGCC CGCGGACTGG AACTGGAAGG CGTCGTCGAC GATACGTCGA TTTCCGGTTA CCTCATCCAG CCCGACCGCC GCACTTACGA GCTCGCGGAG CTGGCGCAGC ACCACCTCAA CATCGAAATC TCCACCGCCG TGGCGAAGGC CGGGCAGCTG GAATTGTCGT TCGACGGCGA TGACTCCGCC GCCGCCGGTG AACTCGTCCA CGCTGCCGCC GTCGTCCACG CCCTCAGCCG CTACTTCGAG GCGGAACTGA AGGAGCGCAA GGCCGAGGAG CTGCTGTCCA CGCTGGAACT TCCGGTGAGC CAGGTACTGG CGGATATGGA ACTCGCCGGA ATCGCCATCG ACATGCAGCG GATGGATGAG CAGCTGGCCG ACCTTGCCAA GGTGATCGAC AACGCCCAGG AACTTGCCTT CGCCGCCATC GGACACGAGG TCAACCTCGG ATCACCCAAA CAGCTGCAGA CCGTGCTGTT CGACGAACTC CAGCTGCCCA AGACCAAGAA GATCAAGTCC GGTTACACCA CCGACGCCGC ATCGCTCAAG AACCTCCTGG AAAAGACCGG GCACGAATTC CTGGTCCAGC TGATGGCGCA CCGGGAATCC TCGAAACTCC GCCAGATGCT GGAGTCGCTG AAGAAGTCCG TCGCCGAGGA CGGCCGCATC CACACCACCT ATGCGCAAAA TGTCGCAGCC ACCGGCCGTA TCTCGTCCAA CAACCCCAAC CTGCAGAACA TCCCCATCCG GAGCGAAGAG GGCCGGCGCG TCCGTGGCAT CTTCGTGGTC AGCGAGGGCT ATGAATGCCT CCTTTCCGCG GACTATTCGC AGATCGAGAT GCGGATCATG GCCCACCTCT CGGGGGACGC CGGCCTGATC CAGGCCTACC GGGACGGCGA AGACCTTCAC CGGTTTGTGG GATCGAACAT CTTCCACGTG CCCACCGACC AGGTCACAAG TGCCATGCGG TCCAAGGTCA AGGCGATGTC CTACGGCCTG GCCTACGGCC TGACCTCGTT CGGACTGTCC AAGCAGCTGG AAATTTCTGT TGACGAGGCC CGGACATTGA TGAAGGAATA CTTCGACCGC TTCGGAGCCG TGCGCGACTA CCTCCGCGGC GTGGTGGACC AGGCCCGGAT CGACGGCTTC ACGGCCACCA TCGAGGGGCG CCGCCGTTAC CTGCCGGACC TCACCAGCAC GGACCGCCAG CTGCGCGAGA ACGCGGAACG CATTGCGCTC AACTCACCCA TCCAGGGTTC CGCGGCGGAC ATCATCAAAC GGGCCATGCT GGGCGTGCAT GCTGAATTGA AGGCCCAGGG CCTCAAATCA CGGATGCTCC TGCAGGTCCA TGACGAACTG GTGCTTGAAG TTGCCGCCGG TGAACGGGAA GCGGTGGAAA AGCTGGTGAC GGAGCAGATG GGCTCCGCCG CGGACCTCAG CGTGCCGCTG GACGTCCAGA TCGGCGTCGG GCCCAGCTGG TACGACGCCG GTCACTAA
|
Protein sequence | MGTRLAPVSE TTKPALAPLS ADTAADILPA AEAESATAAK PARKTTRTAE PVSATEAPVI PITDQPRLLV LDGHSMAFRA FFALPADKFS TAAGQHTNAI HGFTSMLINL IKEQKPTHVA VAFDVSDETT HRKAEYSEYK GGRNETPREM SGQIDLIDKV MGAWGIKTIK MPGYEADDVL ATLAAMGEKA GYEVLLVTGD RDAFQLITDN VFVLYPRKGV SDIPRLDAPA IQEKYFVSPA QYSDLAALVG ESADNLPGVP GVGPKTAAKW INLYGGLEGV LEHLDKIGGK VGDSLRENVD AVKRNRRLNR LHTDLELPVT LDDLADPRPD QAALEQLFDE LEFKTIRTRL FALYGAEEVD LAERESLDIP DYSTPAGAAE LSAFLAAGAG QRSAVAVDLV PGRIGEDAAA LAIVRDGAAV YIDLSGQDAE AENVLAAWLR DPEAPKVMHG FKAALKALSA RGLELEGVVD DTSISGYLIQ PDRRTYELAE LAQHHLNIEI STAVAKAGQL ELSFDGDDSA AAGELVHAAA VVHALSRYFE AELKERKAEE LLSTLELPVS QVLADMELAG IAIDMQRMDE QLADLAKVID NAQELAFAAI GHEVNLGSPK QLQTVLFDEL QLPKTKKIKS GYTTDAASLK NLLEKTGHEF LVQLMAHRES SKLRQMLESL KKSVAEDGRI HTTYAQNVAA TGRISSNNPN LQNIPIRSEE GRRVRGIFVV SEGYECLLSA DYSQIEMRIM AHLSGDAGLI QAYRDGEDLH RFVGSNIFHV PTDQVTSAMR SKVKAMSYGL AYGLTSFGLS KQLEISVDEA RTLMKEYFDR FGAVRDYLRG VVDQARIDGF TATIEGRRRY LPDLTSTDRQ LRENAERIAL NSPIQGSAAD IIKRAMLGVH AELKAQGLKS RMLLQVHDEL VLEVAAGERE AVEKLVTEQM GSAADLSVPL DVQIGVGPSW YDAGH
|
| |