Gene Arth_1243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1243 
Symbol 
ID4446272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1366424 
End bp1371301 
Gene Length4878 bp 
Protein Length1625 aa 
Translation table11 
GC content66% 
IMG OID639689051 
Productheme peroxidase 
Protein accessionYP_830737 
Protein GI116669804 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.941188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAC ATCGCAAGTC CCGAAAGGGG GACGGCACGC CTCATCCGCA GGGCCCGGCG 
GCCGGCGTGA CCCGCAAGGT CACCGCCGGT GCGCTGGCCC TTCTGATCGG GGCGGGGGTT
GTCCCCTCAG TTCTGGCACC AGCCGCTTCC GCGGCTCCGA CGGGCCAGGG ATTCACCCTC
AATGCCGGCG ATATGCGGTT CATTCTCAAG CAGATCAAAA TCGCCGAAAA CAACGCCACC
AGAGAGGACG CACAGGGGAA CGACGTTCCC GGCCAGCCGC TCCTGGGCGA CGGCCCCAAC
CAGGTGGCCA GCCCGTTGCT GCCTTACGGC CTGCGGACCG TTGACGGTAC CGACAACAAC
CTGGTGGCCG GCCAAAGCGG CTACGGCTCC GCCAGCCGTG AGTTCCCCCG GTTGTCGGAC
CCGGAATGGC GCACGTCGTC CGGCGGACAG AACTACGAAT CCGTCAACGC CAACGTCACC
GATAACGGCC CGCGCTTCGT CAGCAACGTC ATTGTGGACC AGACCGCAAC CAACCCCGCA
GCTGTAGCCG CGGCCGGCAA GGCGCACCGA ACGGTCAACG ACGGCCCCAC GGCCGTGCCC
TGCGACGGCA ACGGACTTCC GGAAAACTGC GTGCCCGAGG GAGAGACCCT CGACATTCCC
AACGTCACCA CGGACTTCGG CCTTTCGCCG CCCTACAACG GCATGTTCGC GCTCTTTGGC
CAGTTCTTCG ACCACGGCGT GGATTTCACC AAGAAGACCA AGAACTACGT CATGATGCCG
TTGTCTCCGG ACGACCCGTT ATATGTTCCG GGCGGGCGCA CCAACTTCAT GCTGCTGAAC
AGGGCGGAGA ACCAGCCCGG TCCTGACGGC GTCCTGGGAA CAGCCGACGA CGAGCAGAAC
GCCACCAACA CGGACTCTCC GTGGGTGGAC CAGAGCCAGA CGTATTCCTC GCATTCGTCC
CACCAGGTGT TCCTGCGCGA ATACACCCTG AATGAAAACG GCGATCCCGT GTCCACCGGC
GAGCTCATCG AGGGCGAACC CGGCGGCATG GCAACATGGG CCCGTATCAA GGAACAGGCC
CGGACGATGC TGGGGCTTGA ACTGTCGGAC GTTGACGTGG CGGACATTCC CAAGCTGGCC
ACCGACCAGT ACGGCCGGTT CCTGCGCGGG CCGAACGGAC TCCCGCAGTA CGAGACCGCT
ACCGGGCGCG TTGAAGGAAA CCTGGCGGCT CCGGTGGCAC CGCCGGCCAA CGTGGAACGG
ATCGGCATCG CCTTCCTGGA CGACATCGGC CACAACGCGG CGCCCTTCAA CTCCCAGACC
GGGGCCCCGC TGCAGCCGGA TGACGACGAG GACGTCAACG GCGTCAACGA GCCGCGCCCG
GCCGGCCGCT ATGACGATGA AATGCTGGAC AAGCACTTCG TTGCCGGTGA CGGCCGCGTC
AACGAGAACA TCGGCCTGAC GGCGATCCAC CAGGTGTTCC ACTCCGAGCA CAACCGCCTG
GTGGGCTACA TGGAGGAGCT TCTCACTTCA CAGAACCTCG ACCTCAACGA ATGGAAGCTT
CCCAACGGGC AGTGGAACGG TGAACGGCTG TTCCAGGCGG CACGCTATGT GACCGAGATG
GAGTACCAGC ACATCGTGTT CGAGGACTTT GCCCGCAAGA TCCAGCCCGG CATCAACGGT
TTCAACGTCT TTACGCAGTC GGACACCGGC ATCGACCCCG CCATCCAGGC GGAGTTTGCC
CACGCCACCT ACCGGTTCGG CCACTCGATG CTCACCGAAA CCGTTGACCG CAAGCTCAAC
GACGGCACGG ATATCGGGAT GCCGCTGCTG GACGCGTTCC TCAACCCGCC GGCCTACTAC
GAGAGCACTG CGGGAACCCT GAATCCCAAG CAGGCAGCCG GTGCCATCGC CATGGGGATG
ACGGACCAGG TTGGTGCGGA ACTCGACGAG TTCGTGACCG ACACCCTGCG GAACAACGTC
CTCGGCCTGC CCCTGGACCT TGCCTCGCTG AACCTTGCCC GCGGCCGGGA CACGGGCATC
CCGTCCCTGA ACAACTTCCG CACCCAGCTG TACGCCAGCA CCGGCGAGTC CTCGCTGAAG
CCCTACACCA GCTGGGTGGA TTTCGGCCAG AATCTGAAGC ACCCGGATTC CGTGGTCAAC
TTCATGGCCG CTTACGGCAC CCATGAGACG ATCACCGCGG CAGCGTCCAT CACGGACAAG
CGCGCAGCGG CCCAGCGGCT GTTCGACATG GACACCGCTG ACCCTGCCAC TCCCGCGGAC
TCCTACGACT TCGTCAACAG CGCCGGAGCC TGGGCATCGC AGCCCAGCGG CCTGAACAAC
GTGGACCTGT GGGTGGGCGG CCTCGCGGAG CGCCAGAACC TCTTCGGCGG ACTCCTGGGC
TCCACGTTCA ACTACATCTT TGAACGCCAG ATGACGGACC TGCAGGACGG GGACCGGCTG
TACTACCTGT CGCGCACCTC GGGACTGAAC CTCCGCACGC AGCTGGAAGG CAACTCGCTG
GCAGAGCTGA TCATGCGCAA CACGGATGCG GAAGCCCTCA AGGCGGACGT CTTCGGCGTG
GCGGACTGCG AGTTCGAGCT CGGCCGCATC ACCGCAGGCA CCGGAAACTC CGTGGCGGAC
GACCCGGCCT CCGCCTGCGA CGAATCCGCC CTCCTCATGA GGATGTCGGA CGGCACGATT
CGGTACCGCG TCAGCAACAC CGTGGACCGT CCGGGCCTCA ACGCCCAGTC CACCTTCAAC
GGCACCGCTC TCGGTGACCG GATCTGGGGC GGCATCGACA ACGATACGTT CTGGGGCAAC
GACGGCCAGG ACATCATCGA AGGCAATGAC GGTGCCGACA CTGTCCTTGG CGGTGACGGA
AACGACCGGA TCACGGACTC GCACGGCGAC GACGTCCTGA AGGGCGGCAA CGGAAACGAC
GCCATCGACG CCGGCCCGGG CCTGGACATC ATCATGTCCG GAGATGGCGA CGACTTCTCC
AACGGCGGCC TCAACGGCAA CGAGACGTTC GCCGGCGAAG GCAACGACCT GGTCCTTGCC
GGCGACGGAC CGGACACGGT ATTCGGCGGC GGGGGCGATG ACTGGCAGGA AGGCGGTAAC
TCCAACGACC TGCTGCAGGG CGATAGCGGC GCACCGTTCT TCGACGACAT CAACGCTCCG
GGCCATGACG TCCTGACCGG CGGTTCCGGC GAAGACGACT ACGACGCCGA AGGCGGCGAC
GACGTGATGG TCGCCGGCCC CGGCATCGAA CGCAACCACG GAGTGTTCGG CTTCGACTGG
GTTACGCACG CACGCTCTGT CGAACCGGCC GATTCGGACA TGCGCCAGAT CATCGTCGAC
GGCCCGAACG CCCTCAAGGA CCGTTTCCTC CTGGTCGAGG CGCTCTCCGG CTGGGACAAG
GACGACGTTC TCCGCGGCGA CGACGAGGTG CCGTCCGTGC CCAACACCGA GGTCAACGTC
GAAGGCCTCA GCAACGAACT GGACGCAGCC GGTATTGCCC GGATCAGCGG CCTGGCCGGC
CTGCTGCCTG CGGGTGCCAC CACCTTCGGA GCCGGCAACA TCATCATTGG CGGCTCCGGC
AGCGACCAGA TCTGGGGCAA CGGCGCCGAT GACATCATCG ACGGCGACAA ATGGCTCAAC
GTCCGGCTCA GCGTCCTCGA CGGACCGGGC GGCTCCGAGA TCAGGACCGC GACGTCGCTG
ACCGAACTCC AGGCGGACAT CATGGCCGGG TCCATCGATC CCGGCAACGT GGAGATCGTC
CGGGAAATCC TCGCCAGCCC TGGCGAGGCC GACGTCGACA CGGCAGTGTT CTCCGGCGCC
CGGAATGAGT ACCAGATCAG CACTGTCGGG GGAGTCACCA CCGTGGCCCA CACCGGCGGA
ACCGGCGCCG ACGGCACGGA CCGGATCACC AACGTTGAAC AGTTGAGGTT CACCGACCAG
ACCGTGAACC TCGTTCCGGC TGTCATCCAG GCTCCGGCCG CTCCTCTGAT TGGAACGGCC
GTGGCGGGAA ACACCTCGGC GACGGTGGCC TTCGCGGCGC CGGCAGGCGG GTTCCCCGCT
GACTCCTTCA GCATTGTGGT GCGCACCGGC ACCACCGTGG TCAGGACCAT CGACGGAGTT
CCCGGCAGCG CAACCAGCCA TCAGGTCACC GGACTGAGCA ACGGTACGGC CTACAACTTC
CAGGTCCGGG CGGTCAATTC AGCCGGCGAA AGCCCGCTGT CGGCAGCATC CAACGAGGTG
ACACCGCGCG TACCGGCCTA CGTGCCGCCG GCGGTATCAC CGTTTGCCGA TGTTTCCACC
AACCAGCTCT ACTACCTGGA GATGGCGTGG CTGGCGGATC AGGGCATTTC CACCGGCTGG
ACGGAAGCCA ACGGTACGGT GACCTACCGG CCGCTGCAGT CCATCAGCCG TGACGCCATG
GCAGCCTTCC TCTACCGGAT GGCCGGCTCG CCGGAGTTCA CTGCCCCGGC GGTGTCGCCG
TTCGCGGATG TGTCCACGGG CCAGCAGTTC TATAAGGAGA TGGCGTGGTT GGCGGATCAG
GGCATTTCCA CGGGCTGGAC GGAGGCCAAC GGTACGGTGA CCTACCGGCC GCTGCAGTCC
ATCAGCCGTG ACGCCATGGC AGCCTTCCTC TACCGGGCGG CCGGCTCGCC GGCGTTCACT
GCCCCGGCGG TGTCGCCGTT CGCGGATGTG TCCACGGGCC AGCAGTTCTA CAAGGAGATG
GCGTGGTTGG CGGACCAGGG CATTTCCACG GGCTGGACGG AGGCCAACGG TACGGTGACC
TACCGGCCGC TGCAGCCGAT CAGCCGTGAC GCCATGGCAG CCTTCCTCTA CAGGATGAGC
AACCGGGCGG CCGGCTAG
 
Protein sequence
MAKHRKSRKG DGTPHPQGPA AGVTRKVTAG ALALLIGAGV VPSVLAPAAS AAPTGQGFTL 
NAGDMRFILK QIKIAENNAT REDAQGNDVP GQPLLGDGPN QVASPLLPYG LRTVDGTDNN
LVAGQSGYGS ASREFPRLSD PEWRTSSGGQ NYESVNANVT DNGPRFVSNV IVDQTATNPA
AVAAAGKAHR TVNDGPTAVP CDGNGLPENC VPEGETLDIP NVTTDFGLSP PYNGMFALFG
QFFDHGVDFT KKTKNYVMMP LSPDDPLYVP GGRTNFMLLN RAENQPGPDG VLGTADDEQN
ATNTDSPWVD QSQTYSSHSS HQVFLREYTL NENGDPVSTG ELIEGEPGGM ATWARIKEQA
RTMLGLELSD VDVADIPKLA TDQYGRFLRG PNGLPQYETA TGRVEGNLAA PVAPPANVER
IGIAFLDDIG HNAAPFNSQT GAPLQPDDDE DVNGVNEPRP AGRYDDEMLD KHFVAGDGRV
NENIGLTAIH QVFHSEHNRL VGYMEELLTS QNLDLNEWKL PNGQWNGERL FQAARYVTEM
EYQHIVFEDF ARKIQPGING FNVFTQSDTG IDPAIQAEFA HATYRFGHSM LTETVDRKLN
DGTDIGMPLL DAFLNPPAYY ESTAGTLNPK QAAGAIAMGM TDQVGAELDE FVTDTLRNNV
LGLPLDLASL NLARGRDTGI PSLNNFRTQL YASTGESSLK PYTSWVDFGQ NLKHPDSVVN
FMAAYGTHET ITAAASITDK RAAAQRLFDM DTADPATPAD SYDFVNSAGA WASQPSGLNN
VDLWVGGLAE RQNLFGGLLG STFNYIFERQ MTDLQDGDRL YYLSRTSGLN LRTQLEGNSL
AELIMRNTDA EALKADVFGV ADCEFELGRI TAGTGNSVAD DPASACDESA LLMRMSDGTI
RYRVSNTVDR PGLNAQSTFN GTALGDRIWG GIDNDTFWGN DGQDIIEGND GADTVLGGDG
NDRITDSHGD DVLKGGNGND AIDAGPGLDI IMSGDGDDFS NGGLNGNETF AGEGNDLVLA
GDGPDTVFGG GGDDWQEGGN SNDLLQGDSG APFFDDINAP GHDVLTGGSG EDDYDAEGGD
DVMVAGPGIE RNHGVFGFDW VTHARSVEPA DSDMRQIIVD GPNALKDRFL LVEALSGWDK
DDVLRGDDEV PSVPNTEVNV EGLSNELDAA GIARISGLAG LLPAGATTFG AGNIIIGGSG
SDQIWGNGAD DIIDGDKWLN VRLSVLDGPG GSEIRTATSL TELQADIMAG SIDPGNVEIV
REILASPGEA DVDTAVFSGA RNEYQISTVG GVTTVAHTGG TGADGTDRIT NVEQLRFTDQ
TVNLVPAVIQ APAAPLIGTA VAGNTSATVA FAAPAGGFPA DSFSIVVRTG TTVVRTIDGV
PGSATSHQVT GLSNGTAYNF QVRAVNSAGE SPLSAASNEV TPRVPAYVPP AVSPFADVST
NQLYYLEMAW LADQGISTGW TEANGTVTYR PLQSISRDAM AAFLYRMAGS PEFTAPAVSP
FADVSTGQQF YKEMAWLADQ GISTGWTEAN GTVTYRPLQS ISRDAMAAFL YRAAGSPAFT
APAVSPFADV STGQQFYKEM AWLADQGIST GWTEANGTVT YRPLQPISRD AMAAFLYRMS
NRAAG