Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2871 |
Symbol | |
ID | 7294351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3202691 |
End bp | 3207829 |
Gene Length | 5139 bp |
Protein Length | 1712 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643591285 |
Product | Animal heme peroxidase |
Protein accession | YP_002488925 |
Protein GI | 220913616 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAAAG CACCTCGCGC AACAGCGTGC AGACGAAATG GCACCACCAC CGGAGCAGGG CTGCGTGCCC TTGCAGCATC AGGGGCACTG GCGCTGGTGG CATCCATGGG CCTGCCTCCG TTGGCAGCCC AGGCCGTACA GGCCCCGGTC GGCTCCGGAT TCACCGTCAC CCCGGCCGAT CTGTCCTACA TCCTCAAGCA GATCAAGATC GCCGAAGCGC ACGTGGCAAA TACCACGTCC GCCACCGGCC CGTGCGGGGC GCTGATCGGC ACCGGACCCA ACCAGCTTGC CAGCCCGCTG TTGTCCCACG GCCTGCGGAC CGTGGACGGC AGCTGCAACA ACCTGCAGCC GGGCCAGGAC ACTTACGGCG CCTCGGACCA GGTGTTCCCG CGGCTGGCAC CCAAGGCCTT CGGACCCGCC GAGAGCGGCT CGTTCGGCGG ACCGCCGGTG GCCACAAGCT ACACGCAGAA ATCGGGAAGC GTCTTCGATT CCCGGCCGCG GACCATCAGC AACCTGATCG CGGACCAGAC CTCCACCAAC CCGGCCGCGG TGGCCGCAGC CGGGTTCCCC GCCCGGTCCC AGGGCAACAC CGGCGTGGTG CCGTGCACCA CTGATCCGGA CGCCGAGGCC GTCCCCCCGG TCGCAGCAGC GCCCGAAGGC TGCGTCCCCT CGCACAACAC ACTGGACATC CCCAATGTCA CTACTGACGT GGGCCTGTCA CCGCCCTACA ACTCGCTGTT CACCCTGTTC GGCCAGTTCT TCGACCACGG CATCGACCAG ACCGTCAAGG GCGGTGGAAC CGTCTACGTG CCGCTCAAGG CCGACGATCC GCTGATTGCG GGGCCCGACC ACGATTTCGG CACCGCGGAC GACCTCAATC CCCACCTGCG CTTCATGGTC CTCACCCGCG GGCAGAACCA GCCGGGCCAG GACGGCATCC TGGGCACCGC CGACGATCTC CAGGACGCGC TGAACACCAA CTCTCCATGG GTGGACCAGA GCCAGACGTA TGCGTCCCAC TCCTCGCACC AGGTCTTCCT GCGCGAATAC ACCAACAACC CCGAGGGCCG TCCGGTCTCC ACCGGCGGCC TGCTGGGCGG CCCGGCGGGT ACGGCCGCGG CCGGCGGCAT GGCTACCTGG GCCGACACCA AGAAGCAGGC CCGCGAAATG CTCGGCATCC AGCTCCTGGA CAAGGACGCA CTCAACGTTC CCCTGCTGGC GGCCGACGCC TACGGCAAAT TCATCCCCGG CCCCAAAGAC GGGCTGCCGC AGTTCGTCAC CCGGTCCGGG CTGGTGGAGG CGGACCGGAC CGCGAACGGC GGCAACGGAA CACTGGTGCC CCAGGACATC CTGTACTTCA ACACACCATT CCTGACGGAC ATCGCCCACA ACGCAGATCC TTCGCCCCAG GACACCGACC ACAACCCGGC AACACCCCCG GCGGCACCGG CCCCGGACGC GGACAACACG GCGTCCGCGG ACTTCGCGGC CCAGGCTCCC GGGACGTACG ACGACGAGAT GCTGGGTGCG CACTTTATCG CCGGCGACGG ACGGGTCAAC GAGAACATCG GGCTCACGGC GATACACCAG GTGTTCCACT CCGAGCACGA CCGCCTGGTG GGTGACATTA AGAACGTTCT CACATCGGAC AAGTCGTCCC GCGGTACTGC TGCCCTCACC GAATGGCGGG CGACGGCAGG CGCTGACGGC TGGAACGGCG AGCGCCTCTT CCAAGCGGCA CGCTTCATCG CGGAAATGGA GTACCAGCAC CTTGTCTTCG AGGAGTTTGC CCGCAAGATC CAGCCGGCGG TGAACATCTT CGAACCCTTC GCGTTCTCCC AAACCGACGT CAACCCGGCC ATCAACGCGG AGTTTGCCCA CTCGGTCTAC CGTTTTGGGC ACTCGATGCT CACAGAAACC ATCTCGCGGC GCAACGAGGA CAGCCCGGGG CCCGACGGTG TGTGGGGCAC GCAGGACGAC GTGCCGGGTT CGCAGAACGA CCTGCCGCTC CTGGGAGGAT TCCTGAACCC GCCCGCCTAC ACCGACGGCG GCCCCGCAGG TCCGCTGACC TCCGAGGAGG CCGCCGGCAG CATCGTCATG GGAATGTCGG ACCAGGTGGG CGCGGAACTT GACGAGTTCG TCACGGACAC CCTGCGCAGC AAGCTGCTGG GCCTGCCGAT GGACCTGGCC GCCATCAACC TCGCCCGCGG CCGGTCCGAA GGAATTCCGG CGCTGAACGT CTTCCGCCGG CAACTCCACG GCGCCACCAA CGACAGCCAG CTCAAGCCGT ACGCCAACTG GATCGACTTC GGCGAAAACA TCAAGCACCC CGCTTCGCTG ATCAACTTCA TCGCCGCCTA CGGCACCCAC CCCTCCATCG TTTCCGCCAC CACCCTGGAC GCCAAACGGA AAGCCGCCCG CCTGATAGTC AGCCCGGACG CCCTGGCCGG CGAGGTGGCC CCGGATGATG CGGTGGCATT CATGAACAGC ACCGATGCCT GGGCCAACAA CGGCACAGCA TCAACTACCG GGCTGGACGA CATCGACCTG TGGATGGGCG GACTCGCCGA ACGGACCAAC ATGTTCGGTG GCCTGCTGGG CAGCACCTTC AACTACGTCT TCGAGAGCCA GATGACGGAC CTGCAGAACG GTGACCGGCT GTACTACCTG GCCCGCACAC CCGGGATGAA CCTGATGGCG CAGCTTGAAG GGAACTCCTT CGCGGAGCTG ATCATGCGCA ACACGAACGC AAAGGCCCTG AAGGCTGACG CGTTCGCCAC CGCCGACTGC AAGTTTGAGC TGAAGAACCT GGCGGGAACG TCCGAAGGAT TCGCTGCCTC CGGAAATACA GTGGCGGACG ATGCCGGAAC CGAGTGCAGC GAGACCGCAC TGCTGCTCCG GATGCCCGAC GGCACCATCA AGTACCGGGC CTCCAACTCC GTGGACCCAG TGGGGATCAA CGGCCAGGCC GTGTACAACG GCACGGACCG CGCCGACCGC GTCCACGGCG GCGTGGACAA CGACACCTTC TGGGGCGGAA AAGGCAACGA CGTCATTGAA GGCGGCGACG GCGCAGATAC GGTCCTCGGC GGCGAGGACA ACGACGTGGT CACCGATCTT GCCGGCGACG ACATCCTCAA GGGCGGCCCG GGCAATGACG CGCTCGACGG CGGCCCGGGC CTGGATCTGA TGCTCGGCGG AGACGGCAAG GACTTCATCA ACGGCGGGGC TAACTCGAAC GAAACTTTCG CCGGCGAAGG CGACGACTTC GTGATCGCCG GCGAAGGCCT GGACGGCGTC TTCGGCGGCG GCGGGGACGA CTGGGCCGAG GGCGGCGACA GCCCCGACCT GCTCATCGGC GACTCCAGCA ACCTGTTCTT CCTCGACGAT TCGCAGAAAC CCGGCCACGA CATCCTCATT GGCCAGGGCG GCGACGACGA CTACGACATG GAAGGCGGCG ACGACATCGG CCTGGCCGGG CCGGGCATCG AGAAGGTGGC CGGGGCGTCG GGCTTCGACT GGGAATCCGG TGCCTACGAT CCCCAGCCGC AGGACGCCGA CCTGAACCTG CCGATCGCAC CTTTGGACAT CCTGCAGGTG GGCGTCCGGG ACCGGTACAA CGAGGTGGAA GCACTCTCCG GCGGTCCGTT CGCTGACACC CTCCGCGGCG ACGACCTCAC CCCCCGGACG GTGGGCGGCG GCGGCTTTAT CGGCTGTGAC GTCCTGGACC AGGCCGGCAT CGACCGCATC CTTGGGCTCG ATCAGCTGGT TCCATCCCTG CCAACACCGG TGGAGGACGT CATCAACGCT TCGGCGTCGA AGGAGTGCCC TGTCCTTACC GGATCCCAGG TGTGGGGCGA GGGCAACATC CTCCTCGGCG GTGGCGGCGA CGACACCATC GAAGGCCGCG GTGGGAACGA CATCATCGAC GGCGACCGTT ATCTGAGCGT CCGTCTCAGC GTCCGCACGG ATCCCGCAGA TCCGGCCACC GAAGTAGGCA GCGCCACGTC CATGACGGCA CAGTTCCAGC GCGCCGCGGA CGGCACACTG ACCGGACCCA CACTCCAGCA GGCAGTCTTC GCCGGCAGCA TCGACCCGGC GGACGTCGTT GCGGTGCGGG AGATCCTCAG CTCGGCCGGA GGCACCGACG CCGCACTGTT CTCGGACCTT GAGTTCAACT ACACCATCAC CACCACCGGC GGCGACGGCA CGCTTGGCTC CCCCGGCTCC GTCACCACCG TGCAGCACAA CGGCGGCCGG GACGGCACGG ACACCCTGCG CAACGTCGAA CTGCTCGTCT TTGGCGACGG AACCGGTACC GGAAACGGCG GCAACGGAGC CGACGACGGC GTCACCGAAC CACCGGAGGA CGAGCCCGTC GAAGGCGATG AGGGCGAAGG CGGTGGCATC GGCGGGGTCA TCCCGGGCGA CGGCGAAGTG GTGGTCATCA TCGATCCCGT TCCAGACCCA GGGACGGATC CCGGAACTCC GCCGGTTGAC CCCGCACCGG TTGAGCCCGC ACCAGTGGAC CCCCCAGCGA ACCCGGCACC GGCAGATCCG CCGGCCAACC CGGGGCCCGT CAACCCCGCA CCGGTTGACC CGCCAGCGAA TCCGGCACCG GCTGACCCCG CACCCGCACC CGCTTCCGGC GGCGTCGCCG TCAGGACAGT TGATGCCACG GGCACGCAGG TGGGCGAGAT CATCGTCGCT GAGCCCGGCA CCTCCCGGGT GGTGGTCCGC GGCCTGGTGA ACGGGGAAGC CTACCGGTTC CAGGCGGCCC CCACTACTGG CGGCAGCGCG CCGTCGTTCT CGGCACCGAG CAAACCTGTT GTCCCCGGCC CGGCAGCCGG ACGCCGCAGC CGAGTGGGCG GCAGCGCCGT GGTGCCGGTC CCCGCGGAGC CCGTGACGGA GCAGCAGCCA GCGGTCACCG GGACCCTCCC GCTCGGTGGC CTGGCCGCTT TCGCGCCTGT CCTGCTGACG GCCCTGGGGC ACTATCTGGC CACCGGCGAA GGGGGCACCA CGCTGGGCCT GGTCATCGGA GGATTGCTGG CAGCCGGCGC CTTGCTGGCT TTCCGGTTCC ACTCCGCCCG TTCCAAGGGC AAGGCAGCCC GCGGCGCTAA GGCCGTCGAC AGGGCGTGA
|
Protein sequence | MGKAPRATAC RRNGTTTGAG LRALAASGAL ALVASMGLPP LAAQAVQAPV GSGFTVTPAD LSYILKQIKI AEAHVANTTS ATGPCGALIG TGPNQLASPL LSHGLRTVDG SCNNLQPGQD TYGASDQVFP RLAPKAFGPA ESGSFGGPPV ATSYTQKSGS VFDSRPRTIS NLIADQTSTN PAAVAAAGFP ARSQGNTGVV PCTTDPDAEA VPPVAAAPEG CVPSHNTLDI PNVTTDVGLS PPYNSLFTLF GQFFDHGIDQ TVKGGGTVYV PLKADDPLIA GPDHDFGTAD DLNPHLRFMV LTRGQNQPGQ DGILGTADDL QDALNTNSPW VDQSQTYASH SSHQVFLREY TNNPEGRPVS TGGLLGGPAG TAAAGGMATW ADTKKQAREM LGIQLLDKDA LNVPLLAADA YGKFIPGPKD GLPQFVTRSG LVEADRTANG GNGTLVPQDI LYFNTPFLTD IAHNADPSPQ DTDHNPATPP AAPAPDADNT ASADFAAQAP GTYDDEMLGA HFIAGDGRVN ENIGLTAIHQ VFHSEHDRLV GDIKNVLTSD KSSRGTAALT EWRATAGADG WNGERLFQAA RFIAEMEYQH LVFEEFARKI QPAVNIFEPF AFSQTDVNPA INAEFAHSVY RFGHSMLTET ISRRNEDSPG PDGVWGTQDD VPGSQNDLPL LGGFLNPPAY TDGGPAGPLT SEEAAGSIVM GMSDQVGAEL DEFVTDTLRS KLLGLPMDLA AINLARGRSE GIPALNVFRR QLHGATNDSQ LKPYANWIDF GENIKHPASL INFIAAYGTH PSIVSATTLD AKRKAARLIV SPDALAGEVA PDDAVAFMNS TDAWANNGTA STTGLDDIDL WMGGLAERTN MFGGLLGSTF NYVFESQMTD LQNGDRLYYL ARTPGMNLMA QLEGNSFAEL IMRNTNAKAL KADAFATADC KFELKNLAGT SEGFAASGNT VADDAGTECS ETALLLRMPD GTIKYRASNS VDPVGINGQA VYNGTDRADR VHGGVDNDTF WGGKGNDVIE GGDGADTVLG GEDNDVVTDL AGDDILKGGP GNDALDGGPG LDLMLGGDGK DFINGGANSN ETFAGEGDDF VIAGEGLDGV FGGGGDDWAE GGDSPDLLIG DSSNLFFLDD SQKPGHDILI GQGGDDDYDM EGGDDIGLAG PGIEKVAGAS GFDWESGAYD PQPQDADLNL PIAPLDILQV GVRDRYNEVE ALSGGPFADT LRGDDLTPRT VGGGGFIGCD VLDQAGIDRI LGLDQLVPSL PTPVEDVINA SASKECPVLT GSQVWGEGNI LLGGGGDDTI EGRGGNDIID GDRYLSVRLS VRTDPADPAT EVGSATSMTA QFQRAADGTL TGPTLQQAVF AGSIDPADVV AVREILSSAG GTDAALFSDL EFNYTITTTG GDGTLGSPGS VTTVQHNGGR DGTDTLRNVE LLVFGDGTGT GNGGNGADDG VTEPPEDEPV EGDEGEGGGI GGVIPGDGEV VVIIDPVPDP GTDPGTPPVD PAPVEPAPVD PPANPAPADP PANPGPVNPA PVDPPANPAP ADPAPAPASG GVAVRTVDAT GTQVGEIIVA EPGTSRVVVR GLVNGEAYRF QAAPTTGGSA PSFSAPSKPV VPGPAAGRRS RVGGSAVVPV PAEPVTEQQP AVTGTLPLGG LAAFAPVLLT ALGHYLATGE GGTTLGLVIG GLLAAGALLA FRFHSARSKG KAARGAKAVD RA
|
| |