Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1243 |
Symbol | |
ID | 4446272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1366424 |
End bp | 1371301 |
Gene Length | 4878 bp |
Protein Length | 1625 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689051 |
Product | heme peroxidase |
Protein accession | YP_830737 |
Protein GI | 116669804 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.941188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAC ATCGCAAGTC CCGAAAGGGG GACGGCACGC CTCATCCGCA GGGCCCGGCG GCCGGCGTGA CCCGCAAGGT CACCGCCGGT GCGCTGGCCC TTCTGATCGG GGCGGGGGTT GTCCCCTCAG TTCTGGCACC AGCCGCTTCC GCGGCTCCGA CGGGCCAGGG ATTCACCCTC AATGCCGGCG ATATGCGGTT CATTCTCAAG CAGATCAAAA TCGCCGAAAA CAACGCCACC AGAGAGGACG CACAGGGGAA CGACGTTCCC GGCCAGCCGC TCCTGGGCGA CGGCCCCAAC CAGGTGGCCA GCCCGTTGCT GCCTTACGGC CTGCGGACCG TTGACGGTAC CGACAACAAC CTGGTGGCCG GCCAAAGCGG CTACGGCTCC GCCAGCCGTG AGTTCCCCCG GTTGTCGGAC CCGGAATGGC GCACGTCGTC CGGCGGACAG AACTACGAAT CCGTCAACGC CAACGTCACC GATAACGGCC CGCGCTTCGT CAGCAACGTC ATTGTGGACC AGACCGCAAC CAACCCCGCA GCTGTAGCCG CGGCCGGCAA GGCGCACCGA ACGGTCAACG ACGGCCCCAC GGCCGTGCCC TGCGACGGCA ACGGACTTCC GGAAAACTGC GTGCCCGAGG GAGAGACCCT CGACATTCCC AACGTCACCA CGGACTTCGG CCTTTCGCCG CCCTACAACG GCATGTTCGC GCTCTTTGGC CAGTTCTTCG ACCACGGCGT GGATTTCACC AAGAAGACCA AGAACTACGT CATGATGCCG TTGTCTCCGG ACGACCCGTT ATATGTTCCG GGCGGGCGCA CCAACTTCAT GCTGCTGAAC AGGGCGGAGA ACCAGCCCGG TCCTGACGGC GTCCTGGGAA CAGCCGACGA CGAGCAGAAC GCCACCAACA CGGACTCTCC GTGGGTGGAC CAGAGCCAGA CGTATTCCTC GCATTCGTCC CACCAGGTGT TCCTGCGCGA ATACACCCTG AATGAAAACG GCGATCCCGT GTCCACCGGC GAGCTCATCG AGGGCGAACC CGGCGGCATG GCAACATGGG CCCGTATCAA GGAACAGGCC CGGACGATGC TGGGGCTTGA ACTGTCGGAC GTTGACGTGG CGGACATTCC CAAGCTGGCC ACCGACCAGT ACGGCCGGTT CCTGCGCGGG CCGAACGGAC TCCCGCAGTA CGAGACCGCT ACCGGGCGCG TTGAAGGAAA CCTGGCGGCT CCGGTGGCAC CGCCGGCCAA CGTGGAACGG ATCGGCATCG CCTTCCTGGA CGACATCGGC CACAACGCGG CGCCCTTCAA CTCCCAGACC GGGGCCCCGC TGCAGCCGGA TGACGACGAG GACGTCAACG GCGTCAACGA GCCGCGCCCG GCCGGCCGCT ATGACGATGA AATGCTGGAC AAGCACTTCG TTGCCGGTGA CGGCCGCGTC AACGAGAACA TCGGCCTGAC GGCGATCCAC CAGGTGTTCC ACTCCGAGCA CAACCGCCTG GTGGGCTACA TGGAGGAGCT TCTCACTTCA CAGAACCTCG ACCTCAACGA ATGGAAGCTT CCCAACGGGC AGTGGAACGG TGAACGGCTG TTCCAGGCGG CACGCTATGT GACCGAGATG GAGTACCAGC ACATCGTGTT CGAGGACTTT GCCCGCAAGA TCCAGCCCGG CATCAACGGT TTCAACGTCT TTACGCAGTC GGACACCGGC ATCGACCCCG CCATCCAGGC GGAGTTTGCC CACGCCACCT ACCGGTTCGG CCACTCGATG CTCACCGAAA CCGTTGACCG CAAGCTCAAC GACGGCACGG ATATCGGGAT GCCGCTGCTG GACGCGTTCC TCAACCCGCC GGCCTACTAC GAGAGCACTG CGGGAACCCT GAATCCCAAG CAGGCAGCCG GTGCCATCGC CATGGGGATG ACGGACCAGG TTGGTGCGGA ACTCGACGAG TTCGTGACCG ACACCCTGCG GAACAACGTC CTCGGCCTGC CCCTGGACCT TGCCTCGCTG AACCTTGCCC GCGGCCGGGA CACGGGCATC CCGTCCCTGA ACAACTTCCG CACCCAGCTG TACGCCAGCA CCGGCGAGTC CTCGCTGAAG CCCTACACCA GCTGGGTGGA TTTCGGCCAG AATCTGAAGC ACCCGGATTC CGTGGTCAAC TTCATGGCCG CTTACGGCAC CCATGAGACG ATCACCGCGG CAGCGTCCAT CACGGACAAG CGCGCAGCGG CCCAGCGGCT GTTCGACATG GACACCGCTG ACCCTGCCAC TCCCGCGGAC TCCTACGACT TCGTCAACAG CGCCGGAGCC TGGGCATCGC AGCCCAGCGG CCTGAACAAC GTGGACCTGT GGGTGGGCGG CCTCGCGGAG CGCCAGAACC TCTTCGGCGG ACTCCTGGGC TCCACGTTCA ACTACATCTT TGAACGCCAG ATGACGGACC TGCAGGACGG GGACCGGCTG TACTACCTGT CGCGCACCTC GGGACTGAAC CTCCGCACGC AGCTGGAAGG CAACTCGCTG GCAGAGCTGA TCATGCGCAA CACGGATGCG GAAGCCCTCA AGGCGGACGT CTTCGGCGTG GCGGACTGCG AGTTCGAGCT CGGCCGCATC ACCGCAGGCA CCGGAAACTC CGTGGCGGAC GACCCGGCCT CCGCCTGCGA CGAATCCGCC CTCCTCATGA GGATGTCGGA CGGCACGATT CGGTACCGCG TCAGCAACAC CGTGGACCGT CCGGGCCTCA ACGCCCAGTC CACCTTCAAC GGCACCGCTC TCGGTGACCG GATCTGGGGC GGCATCGACA ACGATACGTT CTGGGGCAAC GACGGCCAGG ACATCATCGA AGGCAATGAC GGTGCCGACA CTGTCCTTGG CGGTGACGGA AACGACCGGA TCACGGACTC GCACGGCGAC GACGTCCTGA AGGGCGGCAA CGGAAACGAC GCCATCGACG CCGGCCCGGG CCTGGACATC ATCATGTCCG GAGATGGCGA CGACTTCTCC AACGGCGGCC TCAACGGCAA CGAGACGTTC GCCGGCGAAG GCAACGACCT GGTCCTTGCC GGCGACGGAC CGGACACGGT ATTCGGCGGC GGGGGCGATG ACTGGCAGGA AGGCGGTAAC TCCAACGACC TGCTGCAGGG CGATAGCGGC GCACCGTTCT TCGACGACAT CAACGCTCCG GGCCATGACG TCCTGACCGG CGGTTCCGGC GAAGACGACT ACGACGCCGA AGGCGGCGAC GACGTGATGG TCGCCGGCCC CGGCATCGAA CGCAACCACG GAGTGTTCGG CTTCGACTGG GTTACGCACG CACGCTCTGT CGAACCGGCC GATTCGGACA TGCGCCAGAT CATCGTCGAC GGCCCGAACG CCCTCAAGGA CCGTTTCCTC CTGGTCGAGG CGCTCTCCGG CTGGGACAAG GACGACGTTC TCCGCGGCGA CGACGAGGTG CCGTCCGTGC CCAACACCGA GGTCAACGTC GAAGGCCTCA GCAACGAACT GGACGCAGCC GGTATTGCCC GGATCAGCGG CCTGGCCGGC CTGCTGCCTG CGGGTGCCAC CACCTTCGGA GCCGGCAACA TCATCATTGG CGGCTCCGGC AGCGACCAGA TCTGGGGCAA CGGCGCCGAT GACATCATCG ACGGCGACAA ATGGCTCAAC GTCCGGCTCA GCGTCCTCGA CGGACCGGGC GGCTCCGAGA TCAGGACCGC GACGTCGCTG ACCGAACTCC AGGCGGACAT CATGGCCGGG TCCATCGATC CCGGCAACGT GGAGATCGTC CGGGAAATCC TCGCCAGCCC TGGCGAGGCC GACGTCGACA CGGCAGTGTT CTCCGGCGCC CGGAATGAGT ACCAGATCAG CACTGTCGGG GGAGTCACCA CCGTGGCCCA CACCGGCGGA ACCGGCGCCG ACGGCACGGA CCGGATCACC AACGTTGAAC AGTTGAGGTT CACCGACCAG ACCGTGAACC TCGTTCCGGC TGTCATCCAG GCTCCGGCCG CTCCTCTGAT TGGAACGGCC GTGGCGGGAA ACACCTCGGC GACGGTGGCC TTCGCGGCGC CGGCAGGCGG GTTCCCCGCT GACTCCTTCA GCATTGTGGT GCGCACCGGC ACCACCGTGG TCAGGACCAT CGACGGAGTT CCCGGCAGCG CAACCAGCCA TCAGGTCACC GGACTGAGCA ACGGTACGGC CTACAACTTC CAGGTCCGGG CGGTCAATTC AGCCGGCGAA AGCCCGCTGT CGGCAGCATC CAACGAGGTG ACACCGCGCG TACCGGCCTA CGTGCCGCCG GCGGTATCAC CGTTTGCCGA TGTTTCCACC AACCAGCTCT ACTACCTGGA GATGGCGTGG CTGGCGGATC AGGGCATTTC CACCGGCTGG ACGGAAGCCA ACGGTACGGT GACCTACCGG CCGCTGCAGT CCATCAGCCG TGACGCCATG GCAGCCTTCC TCTACCGGAT GGCCGGCTCG CCGGAGTTCA CTGCCCCGGC GGTGTCGCCG TTCGCGGATG TGTCCACGGG CCAGCAGTTC TATAAGGAGA TGGCGTGGTT GGCGGATCAG GGCATTTCCA CGGGCTGGAC GGAGGCCAAC GGTACGGTGA CCTACCGGCC GCTGCAGTCC ATCAGCCGTG ACGCCATGGC AGCCTTCCTC TACCGGGCGG CCGGCTCGCC GGCGTTCACT GCCCCGGCGG TGTCGCCGTT CGCGGATGTG TCCACGGGCC AGCAGTTCTA CAAGGAGATG GCGTGGTTGG CGGACCAGGG CATTTCCACG GGCTGGACGG AGGCCAACGG TACGGTGACC TACCGGCCGC TGCAGCCGAT CAGCCGTGAC GCCATGGCAG CCTTCCTCTA CAGGATGAGC AACCGGGCGG CCGGCTAG
|
Protein sequence | MAKHRKSRKG DGTPHPQGPA AGVTRKVTAG ALALLIGAGV VPSVLAPAAS AAPTGQGFTL NAGDMRFILK QIKIAENNAT REDAQGNDVP GQPLLGDGPN QVASPLLPYG LRTVDGTDNN LVAGQSGYGS ASREFPRLSD PEWRTSSGGQ NYESVNANVT DNGPRFVSNV IVDQTATNPA AVAAAGKAHR TVNDGPTAVP CDGNGLPENC VPEGETLDIP NVTTDFGLSP PYNGMFALFG QFFDHGVDFT KKTKNYVMMP LSPDDPLYVP GGRTNFMLLN RAENQPGPDG VLGTADDEQN ATNTDSPWVD QSQTYSSHSS HQVFLREYTL NENGDPVSTG ELIEGEPGGM ATWARIKEQA RTMLGLELSD VDVADIPKLA TDQYGRFLRG PNGLPQYETA TGRVEGNLAA PVAPPANVER IGIAFLDDIG HNAAPFNSQT GAPLQPDDDE DVNGVNEPRP AGRYDDEMLD KHFVAGDGRV NENIGLTAIH QVFHSEHNRL VGYMEELLTS QNLDLNEWKL PNGQWNGERL FQAARYVTEM EYQHIVFEDF ARKIQPGING FNVFTQSDTG IDPAIQAEFA HATYRFGHSM LTETVDRKLN DGTDIGMPLL DAFLNPPAYY ESTAGTLNPK QAAGAIAMGM TDQVGAELDE FVTDTLRNNV LGLPLDLASL NLARGRDTGI PSLNNFRTQL YASTGESSLK PYTSWVDFGQ NLKHPDSVVN FMAAYGTHET ITAAASITDK RAAAQRLFDM DTADPATPAD SYDFVNSAGA WASQPSGLNN VDLWVGGLAE RQNLFGGLLG STFNYIFERQ MTDLQDGDRL YYLSRTSGLN LRTQLEGNSL AELIMRNTDA EALKADVFGV ADCEFELGRI TAGTGNSVAD DPASACDESA LLMRMSDGTI RYRVSNTVDR PGLNAQSTFN GTALGDRIWG GIDNDTFWGN DGQDIIEGND GADTVLGGDG NDRITDSHGD DVLKGGNGND AIDAGPGLDI IMSGDGDDFS NGGLNGNETF AGEGNDLVLA GDGPDTVFGG GGDDWQEGGN SNDLLQGDSG APFFDDINAP GHDVLTGGSG EDDYDAEGGD DVMVAGPGIE RNHGVFGFDW VTHARSVEPA DSDMRQIIVD GPNALKDRFL LVEALSGWDK DDVLRGDDEV PSVPNTEVNV EGLSNELDAA GIARISGLAG LLPAGATTFG AGNIIIGGSG SDQIWGNGAD DIIDGDKWLN VRLSVLDGPG GSEIRTATSL TELQADIMAG SIDPGNVEIV REILASPGEA DVDTAVFSGA RNEYQISTVG GVTTVAHTGG TGADGTDRIT NVEQLRFTDQ TVNLVPAVIQ APAAPLIGTA VAGNTSATVA FAAPAGGFPA DSFSIVVRTG TTVVRTIDGV PGSATSHQVT GLSNGTAYNF QVRAVNSAGE SPLSAASNEV TPRVPAYVPP AVSPFADVST NQLYYLEMAW LADQGISTGW TEANGTVTYR PLQSISRDAM AAFLYRMAGS PEFTAPAVSP FADVSTGQQF YKEMAWLADQ GISTGWTEAN GTVTYRPLQS ISRDAMAAFL YRAAGSPAFT APAVSPFADV STGQQFYKEM AWLADQGIST GWTEANGTVT YRPLQPISRD AMAAFLYRMS NRAAG
|
| |