Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2021 |
Symbol | |
ID | 4445465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2278698 |
End bp | 2279771 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639689829 |
Product | DNA polymerase IV |
Protein accession | YP_831501 |
Protein GI | 116670568 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.482303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGGAA TCCGGTGGGT GCTGCACGTC GATCTCGACC AGTTCATCGC GGCGGTCGAA GTGCTCCGGC GGCCGGAGCT TGCGGGCAAG CCGATCATTG TCGGCGGTCG GGGCGATCCC GCGGAACGAG CTGTGGTGGC GACCGCGTCC TACGAAGCCA GGGCGTTCGG CGTGGGTTCC GGAATGCCGT TACGCATTGC GGCCCGGAAA GTGCCCGACG CTGTGATCCT GCCCGTCGAC CAGGAGGCTT ACCTCGCGGC GTCCGAAACG GTGATGGCTA CCCTGCGCTC GCAGCCGGGC GCCACCGTGC AGGTGCTGGG CTGGGATGAA GCCTTTGTAG GCACTGAGAC AGAGAACCCG GAAGCCTACG CCCGGCAGGT GCAGGCCGCT GTCCTGGAGC GAACGCAGCT GCATTGCAGC ATAGGCATCG GCGACACTTT GGTCCGGGCC AAGGTCGCCA CCGGTTTCGG CAAGCCGGCC GGCGTCTTCC GCCTCACTTC AGCTAACTGG CTCAAGGTCA TGGGCGACCT GCCCACCAAA GACCTGTGGG GCGTTGGAAC CAAAGTGTCT GCCCGGCTGG CCAAACTCGG CATCCACACA GTCGCCGAGC TCGCCGCCAC CGACCCCCGG GACCTCGTTC CGGAGTTCGG CCCCAGGATG GGTCCCTGGT ACGCGGAGCT CGGACGCGGG GACGGCGCCA GCGTTGTGGA CGACACCCCG TGGGTTGCCC GCGGGCATAG CCGGGAGACC ACCTTCCAAC AGGACCTGAC TGCGCCCGCC CAGGTGGACG ACGCAGTCAG GGAGCTGACA GCCCGTGTTC TTGAGGATGT TGAGGCCGAA GGGCGGCCCG TGGTCGGGCT GACCCTCAAG GTTCGGTATG CGCCGTTCTT CACCAAGACC CACGCGAAGA AGATTCCCGA AACATTCGAT AGGGACGAAA TCCTCGCGCG GGCATTGGAC CTCGCAGCCG GAATTGAAGC GGGCCGCCCG ATCCGGCTCC TGGGCATGCG GGCCGAAATG GCAATGCCCG AGGATGCCCG AAAGGGCCAT ACGCCCACGC GCGGCGGTTG GTGA
|
Protein sequence | MSGIRWVLHV DLDQFIAAVE VLRRPELAGK PIIVGGRGDP AERAVVATAS YEARAFGVGS GMPLRIAARK VPDAVILPVD QEAYLAASET VMATLRSQPG ATVQVLGWDE AFVGTETENP EAYARQVQAA VLERTQLHCS IGIGDTLVRA KVATGFGKPA GVFRLTSANW LKVMGDLPTK DLWGVGTKVS ARLAKLGIHT VAELAATDPR DLVPEFGPRM GPWYAELGRG DGASVVDDTP WVARGHSRET TFQQDLTAPA QVDDAVRELT ARVLEDVEAE GRPVVGLTLK VRYAPFFTKT HAKKIPETFD RDEILARALD LAAGIEAGRP IRLLGMRAEM AMPEDARKGH TPTRGGW
|
| |