Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1802 |
Symbol | |
ID | 7293262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 2035095 |
End bp | 2037779 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643590207 |
Product | DNA polymerase I |
Protein accession | YP_002487867 |
Protein GI | 220912558 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000000000000838871 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCGGCC AGCCCCGGCT CCTGGTCCTC GACGGACATT CCATGGCATT CCGCGCGTTC TTTGCCCTGC CTGCAGACAA GTTCTCCACG TCCAACGGCC AGCACACCAA CGCCATCCAC GGCTTCACGT CCATGCTGAT CAACCTCATC AAGGAGCAGC AGCCCACCCA CATTGCCGTG GCCTTCGACG TCTCCGACGA ATCGACCCAC CGCAAGGCCG AGTACAGCGA ATACAAGGGC GGCAGGAACG AAACGCCGCG GGAGATGAGC GGCCAGATCG ACCTCATCGG CCAGGTGATG GAAGCCTGGG GCATCAAGAC CATCAAGATG CCCGGCTATG AGGCCGACGA CATCCTGGCC ACCCTCGCGG CGATGGGCGA GAAGGCCGGC CACGAGGTGC TGCTGGTATC CGGTGACCGC GATGCGTTCC AGCTCATCAC CGACAACGTT TTCGTGCTTT ACCCGAAGAA GGGCGTGAGC GATATCCCGC GGATGGACGC CGCGGCCATC GAGGCGAAGT ACTTCGTCCC GCCGTCCCGC TACTCGGACC TCGCCGCACT GGTGGGCGAG ACCGCGGACA ACCTCCCCGG TGTGCCCGGC GTCGGTCCCA AGACGGCGGC CAAGTGGATC AACCTCTACG GCGGGCTCGA AGGAGTCCTG GAACACCTCG ACTCCATCGG CGGGAAGGTG GGGGACTCGC TTCGCGAAAA CGTGGACAAC GTCAAGCGGA ACCGGCGCCT CAACCAGCTG CACACTGACC TGGAACTGCC GGTCACCCTG GATGACCTGT ACGAGCCCCG GCCTGACCAG GCCGCTCTCG AAGACCTCTT CGATGAACTC GAATTCAAGA CCATCCGCAG CCGGCTCTTC GCGCTCTACG GCAACGACGA CGCCCCCGCC GCCGAGCGCG AAAGCATCGA AGCGCCGGCC TACACCGTTC CAACCACCGC GGACGAACTG GCCTCGTTCC TGGCCGGTGG AGCCGGGCAG CGCTCAGCCC TCGCCGTCGA CCTCGTCCCG GGACGCATTG GCGAAGACGC CGCCGGCCTG GCCATCCTGC GTGCGGATGC CGCCGTGTAC CTGGACCTCA CCGCGCTGGA CGCAGACACC GAAAATGCGC TGGCCACCTG GCTCCGCGAC CCCGAATCGC CCAAAGTCCT GCACGGGTTC AAGGCCGCGC TGAAGGCACT CACCGCCCGC GGGCTGGAGC TCGAAGGCGT GGTGGACGAC ACCTCCATCT CGGGCTACCT CATCCAGCCG GACCGGCGCA CCTACGAGCT CGCCGAACTG GCCCAGCACC ACCTCAACGT CGGGATGCCG GCAGCCACCG CCAAGGCAGG CCAGCTGGAG CTGTCCTTCG ACGGCAACGA CACCGCCGCG GCCGAGGCAC TTGTCCAAGC CGCCGCCGTC GTCTCCGCCC TGAGCCGGTT CTTCGAAACC GAGCTGAAGG AACGCCGGGC CGAGGAGCTC CTGTCCACGC TGGAACTGCC GGTGAGCCGC GTCCTCGCGG ACATGGAGCT GGCCGGGATC GGGATCGACC TGGACCGCAT GGATGAACAG CTCGCCGACC TCGCCAGGGT GATCGACCAG GCGCAGGAGC AGGCCTTCGC GGCGATCGGG CACGAGGTCA ACCTTGGTTC CCCGAAGCAG CTGCAGACGG TCCTGTTCGA GGAACTGCAG CTGCCCAAAA CCAAGAAGAT CAAATCCGGG TACACCACCG ATGCCGCGTC GCTGAAGAAC CTCCTGGAAA AAACAGGCCA CGAGTTCCTG GTCCAGCTCA TGGCCCACCG CGAAGCCGCC AAGCTGCGCC AGATGATCGA GTCCCTCAAG AAGTCGGTAG CCGAAGACGG CCGGATCCAC ACCACCTATG CCCAGAACGT GGCAGCCACC GGCCGGATCT CGTCCAACAA CCCCAACCTG CAGAACATCC CCATCCGCAG CGAGGAAGGC CGGCGCGTCC GCGGCATCTT CGTGGTCAGC GACGGCTACG AGTGCCTGCT GTCCGCGGAC TACTCGCAGA TCGAAATGCG GATCATGGCC CACCTGTCCG GCGACGCCGG CCTGATCCAG GCGTACAAGG ACGGCGAGGA CCTGCACCGG TTCGTCGGGT CCAACATCTT CCACGTGCCC ACGGACCAGG TCACCAGCGC CATGCGGTCC AAGGTCAAGG CCATGTCCTA TGGCCTGGCG TACGGACTGA CATCCTTTGG GCTGTCCAAG CAGCTCGAAA TTTCCGTGGA CGAGGCCCGC ACCCTCATGA AGGACTACTT CGACCGGTTC GGCGCCGTCC GCGACTACCT TCGCGGCGTG GTGGACCAGG CACGCGTGGA CGGCTACACG TCCACCATCG AGGGCCGCCG CCGCTACCTG CCGGACCTCA CCAGCACGGA CCGCCAGCTC CGTGAGAACG CCGAACGCAT TGCGCTCAAC AGCCCCATCC AGGGATCTGC GGCGGACATC ATCAAGCGGG CCATGCTCGG AGTGCACGCC GAACTGCAGG CACAGGGACT CAAATCGCGG ATGCTGCTGC AGGTCCACGA CGAACTCGTC CTCGAGGTGG CGCCGGGCGA ACGTGCCGCC GTGGAAGCGC TGGTCACCGA GCAGATGGCC GCCGCCGCGG ACCTCAGCGT CCCGCTGGAA GTCCAGATCG GCGTCGGGCC CTCCTGGTAC GACGCCGGCC ACTAG
|
Protein sequence | MTGQPRLLVL DGHSMAFRAF FALPADKFST SNGQHTNAIH GFTSMLINLI KEQQPTHIAV AFDVSDESTH RKAEYSEYKG GRNETPREMS GQIDLIGQVM EAWGIKTIKM PGYEADDILA TLAAMGEKAG HEVLLVSGDR DAFQLITDNV FVLYPKKGVS DIPRMDAAAI EAKYFVPPSR YSDLAALVGE TADNLPGVPG VGPKTAAKWI NLYGGLEGVL EHLDSIGGKV GDSLRENVDN VKRNRRLNQL HTDLELPVTL DDLYEPRPDQ AALEDLFDEL EFKTIRSRLF ALYGNDDAPA AERESIEAPA YTVPTTADEL ASFLAGGAGQ RSALAVDLVP GRIGEDAAGL AILRADAAVY LDLTALDADT ENALATWLRD PESPKVLHGF KAALKALTAR GLELEGVVDD TSISGYLIQP DRRTYELAEL AQHHLNVGMP AATAKAGQLE LSFDGNDTAA AEALVQAAAV VSALSRFFET ELKERRAEEL LSTLELPVSR VLADMELAGI GIDLDRMDEQ LADLARVIDQ AQEQAFAAIG HEVNLGSPKQ LQTVLFEELQ LPKTKKIKSG YTTDAASLKN LLEKTGHEFL VQLMAHREAA KLRQMIESLK KSVAEDGRIH TTYAQNVAAT GRISSNNPNL QNIPIRSEEG RRVRGIFVVS DGYECLLSAD YSQIEMRIMA HLSGDAGLIQ AYKDGEDLHR FVGSNIFHVP TDQVTSAMRS KVKAMSYGLA YGLTSFGLSK QLEISVDEAR TLMKDYFDRF GAVRDYLRGV VDQARVDGYT STIEGRRRYL PDLTSTDRQL RENAERIALN SPIQGSAADI IKRAMLGVHA ELQAQGLKSR MLLQVHDELV LEVAPGERAA VEALVTEQMA AAADLSVPLE VQIGVGPSWY DAGH
|
| |