Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3970 |
Symbol | |
ID | 4447630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4484504 |
End bp | 4485724 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639691801 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_833445 |
Protein GI | 116672512 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.126437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAG GATTCGCAGT CGTCGACGTG GAGACGACCG GGCTCTTTCC TGGCAATCAC CGCATTGCCG AGATCGCCGT CGTGCACGTC GATCCAGATG GCACCGTCAG CAACCGCTGG GAGACGCTGG TGAATCCCCA GCGCGACTTA GGCCCGCAGC ACATCCATGG CATTCGGGCT GCCGACATCC TGCACGCTCC CGTTTTCGGG GACATTGCCC ATGACGTCAT GGAGTTGTTG GCAGGCCGCG TGCTGGTCGC GCACAACGTT CATTTCGACT ACCGATTCGT CTCGCACGAG CTAGGGCGCG CCGGCCTCGA GCTGCCGTTC GAAGTCCATG ATTGTTTGTG CACCATGAAA CTGGCACGCC GCTACCTGCC CGGAGCAGGG CGCTCGCTTA AGGACTGTTG CGACAGCTTT GGGCTCAACC TGGCGAACGC CCACAGCGCT GGTGACGATG CAGAGGCCGC CGCGCATGTG CTGAGCAACT ATCTCCGCAT GGATTCTGGT AACCCCGAGT GGTTCAGCGC TTTGGACCGG GCCTACTTTT CTTCGTGGCC TGCCGTGGAG CCGAAGCGCC GTCAGCCTGC CCTTCGCCGC CCGGCGGGTG AGCCGCAGAC TCACTTCCTT GAGAGGCTGG TGAACCGGCT ACCTGAGGTT GCAGGGCCGG ATGAGCACAA CGCCTATCTG GCAATGCTTG ACCGCGCCCT TATGGATCGA CAGATTTCAG TATCCGAGGC TGACGGCCTT GTGGCCCTCG CCGAATCCCT GGACATCAGC AGAAGTACAG CTGAACAACT TCACATCAAG TACATGATTT CCATGCTCGC CGCAGCTTGG GACGACGGTG TGGTTACAAC TGAAGAGGAA GCTGACCTCC GAGTTGTTGG GAACCTGCTC GGCATCAGCC AAGAATCAAT AACACGCGGG CTCGTGGCTC CAGTTGCGGG ACAGGATGAG GAAACTGGTG CGGCAGTCCA GTCGCTGGTG CTCGGGGGCG GCGACAAGAT CGTGCTCACC GGTGAGATGT CGCGAGATCG AAACGACATT GAGGCGGACC TCCGCGCGGC AGGGTTTGTC CCGCATCCGG CGATCACGAA AGCGGTGAGA CTGCTGGTCG CGGCGGATCC CGACAGTTTG TCAGGCAAAG CCAAGAAGGC GCGGAGCTAC GGCATACCGG TAGTCGGCGA GCCCTATCTG ACTACCTTGC TGCGCGGTTA G
|
Protein sequence | MNAGFAVVDV ETTGLFPGNH RIAEIAVVHV DPDGTVSNRW ETLVNPQRDL GPQHIHGIRA ADILHAPVFG DIAHDVMELL AGRVLVAHNV HFDYRFVSHE LGRAGLELPF EVHDCLCTMK LARRYLPGAG RSLKDCCDSF GLNLANAHSA GDDAEAAAHV LSNYLRMDSG NPEWFSALDR AYFSSWPAVE PKRRQPALRR PAGEPQTHFL ERLVNRLPEV AGPDEHNAYL AMLDRALMDR QISVSEADGL VALAESLDIS RSTAEQLHIK YMISMLAAAW DDGVVTTEEE ADLRVVGNLL GISQESITRG LVAPVAGQDE ETGAAVQSLV LGGGDKIVLT GEMSRDRNDI EADLRAAGFV PHPAITKAVR LLVAADPDSL SGKAKKARSY GIPVVGEPYL TTLLRG
|
| |